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Puerto Rico’s future at stake 



T his week, Puerto Rico hosts scientists and other 
stakeholders in Caribbean science to discuss 
“Attaining Sustainability Through Science and 
Education.”* This comes at the end of a summer 
that saw the island’s economy “death spiral” into 
a first-ever default on a bond payment. Its more 
than $70 billion in debt, high unemplojmient, 
and a shrinking economy are causing Puerto Ricans to 
leave the island in droves 
in search of stability. This 
includes young scientists 
and biomedical profession- 
als. Austerity measures be- 
ing considered by the Puerto 
Rican government include 
reducing support for higher 
education, but this would 
only drive the brain drain, 
decimate the scientific en- 
terprise, and reduce the 
capacity to confront the eco- 
nomic and environmental 
challenges: a perfect storm 
that would be devastating to 
Puerto Rico’s future. Robust 
science and technology are 
needed now more than ever 
to avoid this catastrophe. 

The economic crisis is 
partly the consequence of a 
history of neglected investment in science-based plan- 
ning for sustainable development. Puerto Rico is still 
strongly dependent on fossil fuels, consuming more to- 
tal barrels of petroleum per day than any of the seven 
Central American countries, while having a smaller 
population than six of them. Transitions from oil to nat- 
ural gas and particularly to renewable energy resources 
have been slow. Decades of watershed mismanagement 
and an inefficient water distribution system have made 
coping with severe drought difficult. And even though 
the risks are evident, the island has not responded to 
the threats of climate change. The Puerto Rico Climate 
Change Council has projected up to 1.70m of sea-level 
rise by 2110, exceeding the upper limit of the global 
mean indicated by the Intergovernmental Panel on Cli- 
mate Change. Yet most of the population, critical infra- 
structure, and economic activity are still concentrated 
in coastal plains within 1 km of the coast. 

About 10 years ago, Puerto Rico recognized that the 
economy of the 21st century is strongly based on science, 
and the government formed the Puerto Rico Science, 



Technology and Research Trust (and Fund), supported by 
public and private sectors. Its goal has been to support 
research, development, and infrastructure projects. The 
trust recently awarded several $150,000 research grants to 
develop new products in biotechnology and life sciences, 
aerospace, medical devices, and information technology. 
The trust also completed a strategic plan to promote en- 
trepreneurship and innovation through infrastructure- 
and capacity-building. 

Puerto Rico also has made 
recent strides to build en- 
vironments that not only 
retain science professionals 
but are fertile for interna- 
tional collaborations. The 
Turabo University’s Tech- 
nology Accelerator, the Bio- 
process Development and 
Training Complex, the Uni- 
versity of Puerto Rico Com- 
prehensive Cancer Center, 
and the future Science City 
are all geared toward this 
end. The University of Puerto 
Rico’s Molecular Sciences 
Research Center opened in 
2011, supporting basic and 
translational biomedical re- 
search through interactions 
between academia and the 
biopharmaceutical industry and equipped to license and 
commercialize its intellectual property. The University of 
Puerto Rico’s new joint science and biomedicine train- 
ing and development programs for graduate students 
and faculty are in partnership with institutions includ- 
ing Stanford and Yale Universities. The island’s scientific 
diaspora is involved in all of these efforts, particularly 
through Ciencia Puerto Rico, a network with over 7000 
members interested in science and Puerto Rico. 

In June, Puerto Rico— A Way Forward, a report by for- 
mer International Monetary Fund economists (commis- 
sioned by the government), indicated that drastic cuts 
in public spending are needed, including education. Al- 
though restructuring the debt and eliminating the waste- 
ful use of government resources are certainly necessary, 
slashing education is not a viable solution for an economy 
whose recovery relies on scientific innovation activities. 
Science investment in a time of economic turmoil might 
seem like a luxury. However, it is a gateway toward eco- 
nomic and social development. 

- Jorge Colon 




“...reducing support for higher 
education...would be devastating 
to Puerto Rico’s future.” 




Jorge Colon is 
a professor in 
the Department 
of Chemistry at 
the University of 
Puerto Rico, San 
Juan, Puerto Rico, 
and a member of 
Ciencia Puerto 
Rico. E-mail: 
jorge.colonlO@ 
upr.edu 



*Annual Conference of the Caribbean Division of the American Association for the Advancement of Science (AAAS) 

(http://aaascd.rcm.upr.edu/). AAAS is the publisher of Sc/ence. 10.1126/science.aad3620 
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Y Wang, Y Zhang and Y Li make every other academic 
look like a slacker. Each has published around 30,000 papers in 
just 10 years, for an average of nearly 9 papers a day. 55 

From a recent paper in Scientometrics showing how ambiguity in author names has 
compromised Thomson Reuters’s ranking of the most cited researchers. 




Polio resurfaces in West Africa and Ukraine 




Low vaccination rates enabled two polio outbreaks, in Ukraine and in Mali; for each, the culprit was a 
virus derived from the live oral vaccine (shown here in Pakistan). 



C ountries in Ebola-ravaged West Africa are on high alert after polio 
was confirmed in a toddler in Bamako, Mali, on 7 September, just 
a week after two cases were reported in southwestern Ukraine. 
The outbreaks are unrelated, but in both instances the culprit 
is a vaccine-derived poliovirus (VDPV). Rare VDPVs arise when 
the weakened form of a virus used in the live vaccine regains its 
virulence— a danger when vaccination rates are low, which allows the 
strain to circulate and accumulate genetic mutations. In West Africa, a 
19-month-old boy was paralyzed in Guinea on 20 July and traveled to 
Mali for treatment, where a type 2 VDPV was confirmed. With health 
systems and routine immunization in West Africa decimated due to the 
Ebola outbreak, the World Health Organization (WHO) deems the risk 
of spread high. In Ukraine, two children— a 4-year-old and a 10-month- 
old— were paralyzed on 30 June and 7 July. VDPV type 1 was confirmed 
31 August. The risk of spread within Ukraine— where social tumult led 
to falling vaccination rates— is also high, WHO says, but the risk of inter- 
national spread is low. Emergency efforts to fight the outbreaks with mass 
vaccination campaigns could begin this week. http://scim.ag/poiioUkraineMaii 



AROUND THE WORLD 

Grizzly obesity study retracted 

THOUSAND OAKS, CALIFORNIA | Last year, 
researchers announced in Cell Metabolism 
that they had learned how grizzly bears gain 
weight prior to hibernation— without devel- 
oping obesity-associated health problems 
like diabetes. But the journal retracted the 
paper last week after some of its authors 
at Amgen in Thousand Oaks, California, 
announced that the original paper contained 
falsified data. No one outside of Amgen 
knows who tampered with the findings, but 
the scientist has since been let go, according 
to a report in The Wall Street Journal. The 
academic co-authors on the study insist that 
the findings are still viable, and say they are 
currently replicating the falsified portion of 
the study to verify them. 

Mekong forests dwindling fastest 

WASHINGTON, D.c. | Earth lost 18 mil- 
lion hectares of trees in 2014, according 
to a report last week by the Global Forest 
Watch (GFW)— but not in the most 
expected places. Economic factors and new 
laws have slowed the forests’ demise in 
Brazil and Indonesia, the top tree-clearing 
countries, reducing their share of global 
forest loss from 53% in 2001 to 38% in 
2014. Meanwhile, the rate of tree cover loss 
is increasing in Cambodia, Vietnam, and 
other Mekong Delta countries, at rates five 
times faster than in other tropical countries. 
Sierra Leone and Madagascar are not far 
behind, in part because of the development 
of oil palm plantations. And dry tropical 
forests in Paraguay, Argentina, and Bolivia 
are also in trouble. GFW’s estimate of 
total hectares lost roughly agrees with an 
estimate of 15 billion trees per year reported 
last week in Nature. But critics note that 
neither number factors in new trees being 
replanted or growing back naturally. 

o 

o 

Human subject rules revamped | 

X 

WASHINGTON, D.c. | Sixteen U.S. agcu- < 

cies, including the Department of Health < 
and Human Services (HHS), are moving to ^ 
tighten rules intended to protect human F 
research subjects. A notice released last I 



1146 11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 



sciencemag.org SCIENCE 



Published by AAAS 




PHOTOS: (TOP TO BOTTOM) © WENN LTD/ALAMY STOOK PHOTO; EUROPIOS/NEWSOOM 



week fleshes out intended changes to the 
Common Rule, a set of ethical require- 
ments that apply to studies funded by 
the federal government as well as many 
private entities. One change would require 
the consent of participants prior to any 
analyses of their stored blood and tissue 
samples for additional studies unrelated to 
the original research, even if the samples 
are stripped of identifying information. 
Another change would simplify the often 
sprawling informed consent forms by mov- 
ing all but the most essential components 
into an appendix. The new rules would 
also require that a research project span- 
ning multiple institutions be consolidated 
under a single institutional review board. 
HHS and 15 other agencies will accept pub- 
lic comments for 90 days before releasing 
a final rule. http://scim.ag/CommRule 

Long-distance robot control 

NOORDWIJK, THE NETHERLANDS | In vidcO 
games, feeling the response of a simu- 
lated plane to a joystick or the recoil of an 
onscreen gun is called force feedback, or 
“haptic” technology. Haptic feedback is cru- 
cial for space robotics, too: The ability to 
“feel” helps controllers guide robot arms to 
do delicate tasks. This week, the European 
Space Agency (ESA) put this technology to 
the test, when Danish astronaut Andreas 
Mogensen drove a rover around the 
grounds of ESA’s technology center— from 
the International Space Station, 400 kilo- 
meters above Earth. Mogensen took the 
wheel of Interact Centaur, a rover equipped 
with cameras, location sensors, and two 
robot arms. Signals from Mogensen’s 
handheld controllers traveled through a 
geostationary satellite to the rover, a total 
journey of 90,000 kilometers. Mogensen 
directed the rover to do tasks with sub- 
millimeter precision, including finding a 
“task board” where it pulled out and rein- 
serted a pin into a hole. 

Trained monkeys destroy nests 

BEIJING I In an effort to prevent dam- 
age to military aircraft due to accidental 
bird strikes, China’s People’s Liberation 
Army has trained a squadron of male 
rhesus macaques to find and destroy 
birds’ nests. The effort, which focused on 
an airfield located along a major migra- 
tion route for birds known as the East 
Asian-Australasian Flyway was planned 
in advance of an airshow with jet fighters 
on 3 September, the 70th anniversary of 
Japan’s surrender at the end of World War 
II. The macaques scramble up trees and 
dismantle nests by pulling out twigs; two 

SCIENCE sciencemag.org 




Help for the spotted handfish 



T he charmingly awkward spotted handfish (Brachionichthys hirsutus) isn’t an 
avid swimmer: it prefers to clamber along the sandy bottom on its oddly shaped 
pectoral fins. Last month, the most comprehensive survey to date of this critically 
endangered species native to Tasmania, Australia, found just 79 handfish within its 
remaining habitat in the Derwent River and estuary, with three or fewer at several 
locations. Pollution and silt clouding the waters from farming and construction are par- 
tially responsible for its decline. But another major culprit is an invasive sea star that eats 
stalked sea squirts. That’s a problem because the handfish like to lay their eggs around 
these squirts. Biologists with the Commonwealth Scientific and Industrial Research 
Organisation and the University of Tasmania are trying to recreate the handfish’s habitat 
by putting pieces of plastic into the sediment to replace the eaten sea squirts. The sci- 
entists will meet with Australia’s Department of the Environment next month to discuss 
setting up a captive breeding population in case the remaining wild fish go extinct. 



monkeys dismantled about 180 nests over 
1 month, a Chinese airbase commander 
told China Daily in May. Ornithologists, 
however, say the operation is misguided 
at best, and could cause stress and lead to 
higher mortality among migrating birds 
that stop in Beijing on their way south for 
the winter, http://scim.ag/monkeynests 




China’s People’s Liberation Army Air Force has trained 
a group of macaques to drive away birds from airfields. 



FINDINGS 

Alzheimer’s protein: contagious? 

From 1958 to 1985, 30,000 people world- 
wide— mostly children— received injections 
of human growth hormone extracted from 
the pituitary glands of human cadavers. 

The procedure was halted when researchers 
found that a small percentage of recipients 
had received contaminated injections and 
were developing Creutzfeldt-Jakob Disease 
(CJD), a fatal neurodegenerative condi- 
tion caused by misfolded proteins called 
prions. This week, a study of the brains of 
eight deceased people who had contracted 
CJD from such injections suggests that the 
injections may also have spread amyloid- (3, 
the neuron-clogging protein that is the hall- 
mark of Alzheimer’s disease. The study is 
the first evidence in humans that amyloid-p 
could be transmitted through invasive 
medical procedures such as brain surgery 
or blood transfusion, the team writes this 
week in Nature. Skeptics, however, note 
that the CJD prion itself often triggers 
unusual amyloid deposits; epidemiological 
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studies, they say, show no connection 
between the injections and increased risk 
of developing Alzheimer’s disease. 

http://scim.ag/AlzheimersCJD 

New light on Basque’s origins 

Many researchers have assumed that 
Basque— a language spoken by a group 
of people between Spain and France and 
with no known relationship to any other 
tongue— is a “relic language” spoken by the 
hunter-gatherers who occupied Western 
Europe before farmers moved in about 
7500 years ago. But a new study this week 
in the Proceedings of the National Academy 
of Sciences contradicts this idea. The 
researchers sequenced the genomes of eight 
skeletons found in a Basque country cave 
called El Portalon in northern Spain. The 
skeletons date to between 5500 and 3500 
years ago and thus belonged to the farmers. 
The team compared these genomes with 
genomes from other skeletons spanning 
hunter-gatherer to early farming periods 
in Western and Central Europe, from 8000 



to 5000 years ago. They also compared the 
ancient DNA with more than 2000 genomes 
from modern Europeans. Today’s Basques, 
the scientists found, were most closely 
related to those early farmers, who then 
became isolated from later migration waves. 
http://scim.ag/BasqueOrigins 



NEWSMAKERS 

Three Q’s 

Soumya Swaminathan, a pediatrician 
who has spent 2 decades working on 
HIV-associated tuberculosis, is the new 
head of the Indian Council of Medical 
Research (ICMR), India’s premier biomedi- 
cal research organization. She talked with 
Science about her plans for the council. 

Q: What’s India’s biggest health challenge? 

A: Getting quality health care to people 
who need it most— urban slum dwell- 
ers, tribal and remote rural populations. 
We have a high burden of communicable 
diseases, [including] tuberculosis, malaria. 



and diarrhea, with increasing rates of 
noncommunicable diseases [such as] 
hypertension, diabetes, mental health 
problems. Undernutrition is a major factor. 

Q: How can ICMR make a difference? 

A: We need to develop, test, and validate 
indigenous diagnostics for common 
diseases and use modern communications 
technology. For too long, we have ignored 
the role that communities and ordinary 
citizens can play in preventive and promo- 
tive health care. [ICMR] will focus on 
translational and implementation research 
to improve access to quality health care. 

Q: What about funding levels? 

A: Biomedical research definitely needs 
more funding from the government. We 
can [also] leverage existing resources more 
effectively by strengthening partnerships 
with other Indian science agencies and 
with international organizations like the 
NIH, CDC, and MRC. I also propose to 
explore partnerships with the private 
sector and philanthropic organizations. 



BY THE NUMBERS 

6190 

Height, in meters, of Alaska’s Denali, 
the tallest peak in North America, 
according to a new measurement by 
the U.S. Geological Survey. The previous 
accepted elevation, made using 1950s- 
era technology, was 6194 meters. 

0 

Number of HIV infections that 
occurred over 2 years in 657 people in 
San Francisco, California, who took a 
daily antiretroviral drug as prophylaxis. 

300 

Number of induced pluripotent stem 
cell lines— the largest publicly available 
collection— available through a 
new stem cell bank launched last week 
by the Coriell Institute for Medical 
Research in Camden, New Jersey. 




Satellite’s high-resolution mapping instrument fails 

A radar instrument on NASA’s Soil Moisture Active Passive (SMAP) mission, 
designed for high-resolution mapping of the moisture content in the topmost 
5 centimeters of Earth’s soil, has failed because of a problem with its ampli- 
fier, the agency announced on 2 September. With an ability to map moisture in 
areas as small as 9 kilometers across, the radar was supposed to help improve 
forecast models for weather, drought, and flooding. But engineers have exhausted 
possibilities for resurrecting the instrument after it first stopped working on 7 July. 
The other main instrument on the $916 million mission, which launched in January 
(http://scim.ag/SMAPIaunch), is still operational, but that instrument, a passive radi- 
ometer, has lower resolution, at about 40 kilometers across. Scientists had wanted to 
use SMAP to understand how some storms seem to depend on alternating wet and 
dry patches of soil at distances of 10 kilometers. 
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IN DEPTH 



Analyzing the more than 1500 
fossils, representing more than 
15 individuals, was a group effort 
at a workshop in South Africa. 



PALEOANTHROPOLOGY 

New human species discovered 

Bizarre skeletons emerge from South African cave 



By Ann Gibbons 

F rom the moment in 2013 when paleo- 
anthropologist Lee Berger posted a plea 
on Facebook, Twitter, and Linkedin 
for “tiny and small, specialised cavers 
and spelunkers with excellent archaeo- 
logical, palaeontological and excava- 
tion skills,” some experts began grumbling 
that the excavation of a mysterious hom- 
inin in the Rising Star Cave in South Africa 
was more of a media circus than a serious 
scientific expedition. Daily blogs recorded 
the dangerous maneuvers of “underground 
astronauts” who squeezed through a long, 
narrow chute to drop 30 meters into a fossil- 
filled cavern, while Berger, who is based at 
the University of the Witwatersrand in Jo- 
hannesburg, South Africa, became the “voice 
from the cave” in radio interviews. 

When it came time to analyze the fossils, 
Berger put out a call for applications from 
“early career scientists” to study them at a 
workshop in Johannesburg in March 2014. 
Handing over much of the analysis of such 
potentially important specimens to inex- 
perienced researchers didn’t inspire con- 
fidence among Berger’s colleagues either, 
though it did win him the nickname Mr. 
Paleodemocracy. Many thought the expedi- 
tion “had more hype than substance,” paleo- 
anthropologist Chris Stringer of the Natural 



History Museum in London writes in a com- 
mentary accompanying the fossils’ official 
presentation this week in the journal eLIFE. 

But the substance has now been unveiled, 
and few dispute that the findings are impres- 
sive. In their report, Berger and his team 
describe 1550 fossils representing more than 
15 ancient members of a 
strange new kind of hom- 
inin, which they named 
Homo naledi. (Naledi means 
“star” in the Sotho language 
spoken in the region of the 
cave.) It is the largest trove 
of fossils of a hominin ever 
found in Africa— and more 
await excavation at the site, 

50 kilometers northwest of 
Johannesburg. “Lee has to 
be congratulated for finding 
this stuff,” says Fred Grine, a 
paleoanthropologist at the 
State University of New York 
at Stony Brook. 

Although the fossils are still undated, 
making it hard to know where they sit in the 
human family tree, they already reveal a pro- 
foundly different way to be a member of our 
genus Homo. “There is no doubt in my mind 
that this is a new species,” Grine says. 

As soon as spelunkers led by Pedro 
Boshoff, a former student of Berger’s, found 



a handful of ancient teeth and a jaw deep in 
the cave’s Dinaledi Chamber in September 
2013, it was clear that “something bizarre 
was going on,” Berger says. This ancient in- 
dividual had small front teeth like a modern 
human but larger molars like a more primi- 
tive human ancestor. 

When the spelunkers brought up more 
remains, Berger’s perplexity deepened. The 
skull was globular, like a member of our 
genus Homo, but the brain was small and 
primitive. The wrist suggested this species 
was an adept toolmaker, but its shoulder and 
fingers showed it still climbed in trees, like 
more primitive hominins. 
‘We were facing something 
that was different from any- 
thing else that had been de- 
scribed,” Berger says. 

At first, he worried that 
something was wrong with 
the excavation or that the 
teeth came from more 
than one type of hominin. 
Much to his frustration, he 
couldn’t check the dig for 
himself, because he couldn’t 
squeeze down the 20-cen- 
timeter-wide natural chute 
that led into the cavern (see 
diagram, p. 1150). With a quick grant from 
the National Geographic Society, where he is 
an explorer-in-residence, Berger assembled a 
team of thin, young researchers and cavers to 
excavate further. 

“As the teeth came up, you’d see this odd 
tooth; then the same thing in two, three, five, 
10, 50, 100 teeth,” Berger says. “No individual 
part varied.” Other remains— skulls, hands. 




Cavers found 26 of the 27 bones in 
this hand, which has modern and 
primitive features. 
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and foot bones from infants, teens, and the 
elderly— also consistently showed the same 
unusual combination of modern and apelike 
traits. But although the similarity of the fos- 
sils convinced the researchers that they had 
uncovered a new species, anal}^ing all 1550 
was a Herculean task. “No one had ever tried 
to publish more than 100 fossils of a single 
species at once,” Berger says. 

In the end, the work of more than 60 re- 
searchers yielded a picture of “a relatively 
tall, skinny hominid with long legs, human- 
like feet, with a core and shoulder that is 
primitive,” Berger says. Some body parts have 
come into sharper focus than others. In an 
analysis of the remarkably complete hands, 
paleoanthropologist Tracy Kivell of the Uni- 
versity of Kent in the United Kingdom found 
that bones in the wrist were shaped like 
those in modern humans, suggesting that 
the palm at the base of the thumb was quite 
stiff. That would allow forces to dissipate 
over a larger area of the hand than in more 
primitive humans— a trait associated with 
tool use. At the same time, H. naledi had a 
weird thumb and long, curving fingers, as if 
it still spent a lot of time climbing. 

In a separate analysis of the foot, paleo- 
anthropologist William Harcourt-Smith of 
the American Museum of Natural History 
in New York City and his colleagues noted 
that its proportions were modern and the big 
toe was aligned with the other toes, rather 
than diverging like an ape’s grasping big toe. 
Taken together, the hand and foot suggest 
“you have a creature that would have walked 
upright really well but also would have been 
comfortable in the trees,” Harcourt-Smith 
says. Harvard University paleoanthropologist 
Dan Lieberman agrees: “The foot is indeed 
strikingly modern ... and suggests it walked 
diXid. possibly ran much like modern humans.” 



No other animals were found in the sub- 
terranean chamber except a hapless owl and 
a few rodents. As a result, the team rules 
out the idea that animals dragged the hu- 
man remains into the cave. Instead, they 
suggest that other humans, perhaps modern 
humans, deposited the bodies there— as in a 
crypt. But the team can’t rule out alternate 
explanations— for example, that a group of 
H. naledi found their way into the cave but 
got lost or trapped. 

Despite the wealth of data, the initial anal- 
ysis invites more questions than it answers. 
Stringer writes, including “Where does 
H. naledi fit in the scheme of human evolu- 
tion?” Determining the age of the bones or 
cave would help answer that, but caves are 
notoriously difficult to date, and the bones 
may be too old for radiocarbon dating. 

Still, “it is absolutely essential that the re- 
mains be dated,” says paleoanthropologist 
Curtis Marean of Arizona State University, 
Tempe. “For example, if they date to the last 
300,000 years, then it is plausible that early 
modern humans killed them and stashed 
them in the cave as part of a ritual.” A date 
would also illuminate whether the fossils 
represent an ancient population or an iso- 
lated relic that persisted until fairly recently, 
says paleoanthropologist Bernard Wood of 
George Washington University in Washing- 
ton, D.C. 

Berger says now that his team has de- 
scribed the fossils, they can focus on dating 
them, trying to extract DNA, and extending 
his exercise in paleodemocracy by recruit- 
ing still more young researchers to the ef- 
fort. “The Rising Star expedition isn’t a 
media stunt,” Berger says. “I felt it was 
the first time in history to bring the entire 
world into a place where we discovered a 
hominid fossil.” ■ 



MEDICAL NANOTECHNOLOGY 

Spherical 
nucleic acids 
start rolling 

Promise of cancer-seeking 
nanoparticles wins funding 
for research center 

By Robert F. Service 

T he shape of gene therapy to come 
may be round. Last week, the US. Na- 
tional Cancer Institute (NCI) awarded 
Northwestern University in Evan- 
ston, Illinois, $11.7 million to pursue 
nanoparticles called spherical nucleic 
acids (SNAs) as a new avenue for treating 
everything from cancer to psoriasis. The 
particles, typically consisting of an inert core 
carpeted with DNA or RNA, have already 
proved biologically potent, able to switch 
genes on and off and kick the immune sys- 
tem into action. 

The institute is one of six centers funded 
last week as part of NCI’s third 5-year cancer 
nanotechnology program, but the only one 
devoted to SNAs. And the grant marks what 
could be a turning point for these versatile 
particles. After decades of test tube and ani- 
mal experiments, they are now about to en- 
ter human trials. 

SNAs found their way to the clinic through 
the back door. Northwestern chemist Chad 
Mirkin and colleagues made the first ones 
in 1996, from tiny gold nanoparticles, 
each festooned with about 100 identical 
short strands of DNA or RNA. At the time, 
Mirkin’s team wanted to build new 
materials by harnessing nucleic acids’ skills 
at binding complementary strands to as- 
semble nanoparticles into 3D structures. The 
effort succeeded, but in the process, Mirkin 
says, “we discovered [SNAs] have other very 
unusual and interesting properties.” 

For starters, SNAs can seek out and bind 
to very low quantities of complementary 
DNA and RNA. They do that even better 
than linear nucleic acids because each SNA 
harbors so many nucleic acid copies. That 
knack has made them ideal as nucleic acid 
detectors. Nanosphere, a biotech company 
that Mirkin co-founded in Northbrook, 
Illinois, has exploited it to develop SNA 
tests that can identify pathogenic bacteria 
and viruses in blood and other samples 
without the need for the time-consuming 



Treasure map 

South Africa’s Rising Star Cave yielded a trove of bones from what turned out to be a new 
Homo species. Excavating them was a task for small, slim cavers. 
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In a cell, an SNA (red) is as effective as a special stain (green) at highlighting DNA-containing mitochondria. Yellow shows where the methods overlap. 



process of culturing microbes. 

Mirkin’s team also discovered that SNAs 
readily enter cells— a complete surprise, he 
says. Because DNA and cell membranes are 
both negatively charged, he assumed they 
would repel each other, as happens when 
free-floating linear DNA encounters a cell 
membrane. Yet nearly all cell types readily 
engulf SNAs in compartments called endo- 
somes, which trap materials for possible 
destruction. Mirkin’s team found that many 
SNAs manage to escape into the cell body, 
although researchers still don’t understand 
just how they slip the net. Once there, an 
SNA’s strands of DNA or RNA can bind to 
complementary DNA or RNA in the cell, 
detecting or even silencing genes involved 
in disease. 

In 2012, for example, Mirkin and col- 
leagues reported that when they applied 
SNAs to the skin of mice, skin cells readily 
took up the nanoparticles, which knocked 
down the expression of genes normally up- 
regulated in psoriasis. A similar skin study 
showed promise for speeding wound healing. 

The particles are proving equally adept 
when injected into the body. In one such 
study in Science Translational Medicine in 
2013, SNAs were programmed to home in 
on specific RNAs in patients with glioblas- 
toma multiforme, one of the deadliest forms 
of brain cancer. The targeted RNAs speed 
cancer growth by helping make proteins 
that turn off a tumor suppressor gene in the 
cancer cells. When injected into the tail veins 
of mice with glioblastoma, the SNAs crossed 
the blood-brain barrier, bound to their target 



RNAs, and knocked down the expression of 
the antitumor suppressor protein, lengthen- 
ing the life span of the mice. 

More recently, the researchers have tested 
SNAs as cancer vaccines, which train im- 
mune cells to seek out and destroy malignan- 
cies. In the 31 March issue of the Proceedings 
of the National Academy of Sciences, Mirkin 
and his colleagues at Northwestern and Au- 
raSense Technologies, a SNA drug startup 
company since renamed Exicure, re- 
ported that they had created SNAs 
that could bind to a cancer-cell 
protein and direct it to a key 
immune-cell receptor. The 
SNAs carried short frag- 
ments from a protein 
called ovalbumin (Ova), 
which is overexpressed 
in cells with lymphoma, 
a type of blood cell cancer. 

The particles were tailored 
to home in on so-called toll- 
like receptors, which are key to 
getting immune cells to target 
foreign peptides. In mice, the 
SNAs coaxed immune cells to 
attack Ova-containing cells, reduced the size 
of lymphoma tumors, and lengthened sur- 
vival. “We found that the mice eventually do 
die of cancer. But when we took the tumors 
out, there were no Ova-containing cells left,” 
Mirkin says. 

Lately, the Northwestern team has tried 
combining immune-stimulating SNAs with a 
new class of cancer drugs called checkpoint 
inhibitors. These take aim at immune cell 



proteins called checkpoints, which inhibit im- 
mune responses. Disabling the checkpoints 
can unleash immune cells to destroy cancer 
{Science, 20 December 2013, p. 1432). At the 
American Chemical Society (ACS) meeting 
last month in Boston, Mirkin reported pre- 
liminary evidence that when he combined 
his immune-boosting SNAs with a conven- 
tional checkpoint inhibitor, they eliminated 
tumors in a small number of animals models 
of breast cancer and melanoma. 

Such successes are “really ex- 
citing,” says Karen Wooley a 
chemist at Texas A&M Uni- 
versity, College Station, 
who heard Mirkin detail 
his latest work at the ACS 
meeting. Robert Langer, a 
nanotechnology expert at 
the Massachusetts Insti- 
tute of Technology in Cam- 
bridge, agrees. “Chad’s work 
is excellent,” he says. 

Work funded by the NCI 
grant could soon extend such 
successes to human patients. 
Mirkin says human clinical tri- 
als with glioblastoma patients are expected 
to begin in the United States this fall. And by 
next spring, the team expects to begin similar 
trials with psoriasis patients in Europe. 

If those prove successful, Mirkin says his 
team could roll out a host of new trials in 
short order. After all, Mirkin says, “to target 
novel diseases we don’t need to come up with 
new molecules, we just need to change the 
[nucleic acid] sequence.” ■ 




Tendrils of nucleic acids 
enable SNAs to bind to genes 
or RNA in cells. 



SCIENCE sciencemag.org 



11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 1151 



Published by AAAS 






CLIMATE CHANGE 

A cold, creeping menace 

In Alaska, frozen debris lobes threaten a key lifeline 



By Eli Kintisch, in Fairbanks, Alaska 

B arack Obama last week became the 
first sitting US. president to visit the 
nation’s Arctic territory, using a so- 
journ in this vast state to highlight the 
risks posed by climate change. They 
include, Obama said, the accelerated 
thawing of the frozen permafrost that sits be- 
neath “the earth on which 100,000 Alaskans 
live, threatening homes [and] damaging 
transportation and energy infrastructure.” 

It was a familiar warning to geological 
engineer Margaret Darrow of the University 
of Alaska, Fairbanks. Just before Obama ar- 
rived, she had spent a week in the field with 
a research team seeking to better understand 
one emerging hazard caused by thawing 
permafrost: a massive, slow-motion landslide 
that threatens the Dalton Highway, the only 
land route into Alaska’s lucrative North Slope 
oil fields and arguably the state’s most impor- 
tant road. Researchers have so far identified 
43 of these frozen debris lobes (FDLs) along 
the highway’s route through the rugged 
Brooks Range— including a 20-meter-high 
slide that has bulldozed its way to within 40 
meters of the northbound lane. If the slide, 
dubbed FDL-A, reaches the road, it would 
dump some 45 metric tons of debris daily- 
more than enough to fill a dump truck. 

The slides create what looks “like a war 
zone,” Darrow says: cracked earth, toppled 
trees, and layers of moss and debris wrapped 
into jelly rolls. When it’s quiet, she adds, you 
can hear trees straining as the mass creeps 
along at an average of 1.4 centimeters a day. 
As Alaska’s warming softens their icy under- 
pinnings, FDLs are now “a serious impend- 
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ing geohazard,” Darrow says, noting that 
FDL-A’s speed has increased by 40% in the 
past 7 years. 

FDL’s didn’t get much attention in 1974, 
when engineers built the gravel, 666-kilo- 
meter-long Dalton Highway from outside 
Fairbanks to Deadhorse, near the Arctic 
Ocean, in order to service the Trans-Alaska 
Pipeline System, which parallels the road. 
Geologists knew the FDLs existed on adja- 
cent slopes, but they were barely moving. 
Then, in 2008, satellite photos revealed that 
a mass of frozen earth about 260 meters 
wide and 1200 meters long had inched to 
within 150 meters of the road at milepost 
219, near the town of Coldfoot. Soon, state 
and federal officials were asking research- 
ers to take a closer look at FDL-A, and oth- 
ers like it— and to consider ways of stopping 
them in their tracks. 

Since then, scientists have used airborne 
radar and other sensors to create 3D maps 
of the slides and estimate how fast they are 
moving. On the ground, they’re drilling cores 
through the FDLs to see inside. In a bid to 
understand why some lobes are moving 
faster than others, they have measured slope 
angles and identified soil types. 

There have been plenty of surprises. One 
was the liquid water that gurgled up from 
under FDL-A in 2012, as scientists probed 
the critical “shear zone” in the permafrost 
beneath the slide. “You see water com- 
ing out and you’re like: ‘What the heck?”’ 
recalls Darrow, who says the liquid likely 
seeps in from the surrounding soil and re- 
mains supercooled in the frozen ground. 
Its high pressure— 155,000 kilograms per 
square meter— enables it to lubricate the 



Scientists are unsure how to stop massive frozen 
landslides inching toward Alaska’s Dalton Highway. 

FDL’s downhill slide, in the same way a 
thin layer of water enables a hockey player’s 
skate blade to slide across the ice, says state 
geohydrologist Ronald Daanen in Fair- 
banks. As Arctic soil temperatures rise, ba- 
sic physics holds that the pressure— and the 
lubrication— will increase, he adds. “Just a 
little more [water] pressure and [the lobes] 
will be floating,” allowing them to slide 
faster, Daanen says. 

That worrying scenario could also hold 
true for other permafrost formations found 
in Alaska on steep slopes. Daanen says 
vegetation often holds in place these piles, 
which are slower moving, but “I’m afraid 
with warming temperatures there’s a tipping 
point after which gravity takes over.” 

In the meantime, researchers are mulling 
ways to defend the Dalton Highway from 
FDL-A. (The pipeline, some 800 meters 
away, is not at risk in the near-term.) Dig- 
ging a trench to funnel water away from the 
lubricating shear zone is one idea. But re- 
searchers aren’t sure how to best configure 
the ditch, because they have data only from 
the 2012 borehole. Engineers have also en- 
visioned a buttress wall to block the slide, 
but Darrow fears it might be little more 
than “a speed bump” for the massive lobe. A 
more radical idea is to install underground 
cooling pipes to freeze the slide in place. ^ 
But the immense forces could rip apart the I 
pipes, and they may not work as intended, § 
Daanen says. For starters, the state plans to g 
simply reroute the road “to buy some time,” | 
says Michael Coffey of Alaska’s Transporta- ^ 
tion Department in Juneau. | 

Coffey estimates that, at current move- < 
ment rates, it could take 10 years for FDL-A > 
to reach the highway. But researchers say it | 
could speed up given the unpredictable be- | 
havior of other FDLs, which have moved as g 

o 

much as 10 times as fast. ■ I 
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BIOSECURITY 

Flu study raises questions about U.S. ban 

A new viral “backbone” could speed up production of influenza vaccines 



By Jon Cohen 

W hen the U.S. government tempo- 
rarily halted a class of risky ex- 
periments on influenza, SARS, 
and Middle East respiratory syn- 
drome last year, some research- 
ers warned that the move could 
harm public health rather than help it. 
The far-reaching moratorium, which tar- 
geted studies that make the viruses more 
transmissible or pathogenic, would also 
impede low-risk research that could help 
fight pandemics, they argued. 

Now, one of the researchers whose work 
triggered the moratorium has pub- 
lished a paper that he says underscores 
that point. In the new study, published 
online in Nature Communications on 
2 September, Yoshihiro Kawaoka of 
the University of Wisconsin, Madison, 
engineered heartier influenza viruses 
to streamline vaccine production, 
which he says could help bring future 
pandemics under control. 

The study, which was partly funded 
by the U.S. National Institute of Al- 
lergy and Infectious Diseases (NIAID), 
was completed before the morato- 
rium took effect. But Kawaoka says it 
couldn’t have been done today— even 
though it carried little risk. 

NIAID Director Anthony Fauci 
disputes the argument. “It is very 
likely” that Kawaoka’s group would 
have been granted an exemption— 
and would have continued to receive 
NIAID support— because the study spe- 
cifically aims to improve vaccines against 
influenza, Fauci says. But others say the 
cumbersome exemption process could de- 
ter such work. One lab that received an ex- 
emption for similar work told Science that 
the process took 3 months. 

Kawaoka published one of two highly 
controversial papers in 2012 that showed 
how so-called gain-of-function (GOF) mu- 
tations made the highly lethal H5N1 avian 
influenza virus more transmissible, and 
hence potentially more capable of causing a 
pandemic. The ensuing furor contributed to 
the U.S. government’s decision to announce 
a 1-year funding pause in October 2014 of 
GOF experiments that could be “reasonably 
anticipated” to increase transmissibility or 
pathogenicity. Laboratories were also asked 
to halt ongoing GOF studies. 



Before the moratorium, Kawaoka’s team 
had set out to address a perennial problem 
in the production of influenza vaccines: 
the scramble to produce them in time for 
each year’s flu season. To keep up with the 
ever-mutating flu virus, researchers have 
to create new vaccines each year, tailor- 
made to combat the strains they expect to 
dominate— and rush them into production. 

The viruses used to make vaccines tyi^i- 
cally contain a “backbone” of six genes 
plus two variable genes that code for the 
proteins studding the viral surface. These 
surface proteins are critical for provoking 
a protective immune response, so vaccine- 



makers combine the surface proteins 
currently in circulation with a standard 
backbone. The companies then grow these 
viruses in eggs or in mammalian cells. 

In the new study, Kawaoka’s team ran- 
domly mutated the backbone genes to 
create a large library of variants and then 
tested them with many different surface 
proteins. They found a novel backbone 
that— for reasons they have not been able 
to pin down— increased the yield of the 
viruses in eggs and mammalian cells, by 
more than 200-fold in some cases. 

The study “tackles a question that is of 
great pragmatic importance,” says viro- 
logist Gary Nabel, chief scientific officer 
at flu vaccine producer Sanofi in Cam- 
bridge, Massachusetts. “If you can’t make 
enough vaccine at an acceptable cost, it 
won’t be available to many who need it.” 



Fauci notes the new work is especially 
relevant because it sped up growth in 
mammalian cells. That’s a faster vaccine 
production system than chicken eggs, and 
it avoids mutations that occur when the vi- 
rus adapts to eggs, which can compromise 
vaccine effectiveness. 

Kawaoka stresses that the experiment 
did not create dangerous viruses. Flu vi- 
ruses with the new backbone only mar- 
ginally increased the severity of disease in 
mice, the team found— and manufacturers 
routinely kill the virus as part of the vac- 
cine production process in any case. Nor 
could anyone glean information from the 
paper that might help construct more 
■ deadly human influenza viruses, 
Kawaoka says. “We have no idea 
whether these mutations would make 
other influenza viruses more patho- 
genic or transmissible,” he says. “I 
don’t think people who are seriously 
concerned about [GOF studies] are 
concerned about this type of work,” 
he says. “The net was cast too wide.” 
Fauci’s support for studies such as 
Kawaoka’s does not address a broader 
issue, says former National Sci- 
ence Advisory Board for Biosecurity 
(NSABB) member Arturo Casadevall, 
an immunologist at the Johns 
Hopkins Bloomberg School of Pub- 
lic Health in Baltimore, Maryland. 
“Moratoriums and pauses are blunt 
instruments,” Casadevall wrote in an 
email. “I worry that even if exclusions 
are made for certain experiments 
so that they continue, the GOF contro- 
versy combined with its pauses/moratori- 
ums has already created an environment 
where scientists may be discouraged from 
experimental work that is clearly in the 
public interest.” 

The U.S. government has asked NSABB 
to review the risks and benefits of GOF 
research and hired Gr}q)hon Scientific of 
Takoma Park, Maryland, to conduct an 
analysis. NSABB will discuss the review 
on 28 September, and Casadevall wrote, “I 
hope that the NSABB takes notice of this 
paper as an example of the potential value 
of these experiments.” 

Carrie Wolinetz, a policy official at 
the U.S. National Institutes of Health in 
Bethesda, Maryland, says NSABB does not 
plan to recommend any changes to the 
moratorium until next spring. ■ 



A better vaccine virus 

Flu vaccine viruses contain a “backbone” of six 
genes that remain the same (grey) and two variable 
genes encoding surface proteins (red and blue). 
Researchers improved virus growth by making 
mutations to the backbone. 
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GLOBAl 

PAYER 

As head of the $29 
billion Wellcome Trust, 
Jeremy Farrar took 
the stage during the 
Ebola epidemic. Now, 
he wants to make the 
trust a world leader 

By Kai Kupferschmidt 




S ome appointments seem prescient 
in retrospect. On 1 October 2013, 
Jeremy Farrar took over the reins 
of the Wellcome Trust. Three 
months later, an 18-month-old boy 
in the tiny village of Meliandou 
in Guinea developed a fever and 
started vomiting. He died 2 days 
later, the first victim in the largest 
Ebola outbreak the world has ever seen. 

As the epidemic grew into an inter- 
national crisis, the topics that had domi- 
nated Farrar’s career— pandemic prepared- 
ness, international cooperation, fighting 
infectious diseases in poor countries, clini- 
cal studies in the midst of an emergency— 
became global priorities. And as the brand- 
new head of the world’s second-largest 
private funder of medical research, Farrar 
became determined to make a difference. 

Even before the scale of the disaster had 
sunk in, the Wellcome Trust started provid- 
ing millions of dollars to conduct clinical 
trials. Farrar chastised the world for not 
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acting more decisively, lobbied for mak- 
ing experimental therapies available, and 
joined high-level panels to guide the R&D 
response to the drama. 

Farrar had previously worked far from the 
media spotlight, heading a Wellcome Trust 
research group in Ho Chi Minh City, Vietnam, 
for 18 years. Now, almost overnight, his name 
became widely known. Fortune magazine 
named him among the “50 greatest leaders 
of the world,” ahead of people like Facebook 
founder Mark Zuckerberg. “The trust had a 
lot of influence before, of course, in the U.K.,” 
says James Wilsdon, a science policy special- 
ist at the University of Sussex in the United 
Kingdom. “But in the Ebola outbreak they re- 
ally stepped onto the international stage and 
became much more vocal and visible.” 

The research push paid off. On a recent 
Wednesday morning, Farrar stood wearing 
jeans and a blue and white shirt in front of 
several hundred staff members of the Well- 
come Trust assembled on the sixth floor of 
the trust’s modern office in central London. 



“I appreciate that all of you go to bed with a 
copy of The Lancet every Thursday,” he said. 
“But if you don’t, make sure you do so this 
week.” The journal was about to publish the 
results of an Ebola vaccine trial funded by 
the trust, Farrar revealed, and they were 
“very good, probably even better than that.” 
Indeed, the paper made headlines 
around the world. The vaccine, produced by 
Merck & Co., had been 100% effective in a 
trial set up by the World Health Organiza- 
tion (WHO) in Guinea. It was the first time 
ever that an Ebola vaccine had been shown 
to protect humans, raising hopes that future 
outbreaks can be contained much faster— | 
and the trust’s role had been essential. “If I 
we hadn’t been supported by Jeremy and | 
the Wellcome Trust, we would not have been 2 
able to do anything apart from convening t 
a meeting,” says Marie-Paule Kieny, WHO’s 2 
assistant director-general for health systems | 
and innovation. § 

Still, Farrar was not happy. A few days af- | 
ter the trial results came out, he concluded I 
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in The Guardian that the world had failed 
the people in West Africa. “The vaccine 
could and should have begun testing much 
more quickly,” he wrote. Governments, com- 
panies, and philanthropies need to form a 
global fund to develop a series of vaccines 
for testing in the next deadly epidemic, 
he argued. “Ebola should change how the 
world reacts to such outbreaks,” Farrar says. 

The Wellcome Trust, whose assets top 
£18 billion ($29.2 billion), can help bring 
about that change, Farrar says. He also 
plans to make the trust more international 
and dynamic; give younger scientists a leg 
up; coordinate more closely with another 
public health behemoth, the Bill & Melinda 
Gates Foundation; and focus on areas 
where Wellcome can have a 
major impact. “It’s not just 
about giving money,” Farrar 
says. “It’s shifting the Well- 
come Trust from being just 
a funder into being a sort of 
thought leader.” 

THE WELLCOME TRUST is 

the legacy of an eccentric 
American-British business- 
man with a nose for mar- 
keting. Henry Wellcome 
was born in 1853 in a log 
cabin in Wisconsin, then 
part of the Wild West. At 
age 16 he marketed his first 
product, “Wellcome’s magic 
ink,” almost certainly lemon 
juice. Wellcome studied 
pharmacy in Chicago, Il- 
linois, and Philadelphia, 

Pennsylvania, and moved 
to London in 1880, where he and a for- 
mer classmate, Silas Burroughs, founded a 
pharmaceutical business. 

At the time, most drugs in the United 
Kingdom were produced locally by phar- 
macists, using pestle and mortar, but Bur- 
roughs Wellcome and Company started im- 
porting an American novelty: compressed 
pills, which were safer because they con- 
tained a standardized dose. The pills be- 
came an instant success; later, the com- 
pany started making them itself. Wellcome 
coined the trademarked name “tabloids,” 
which turned out to be a marketing boon. 
As a publicity stunt, Wellcome gave some of 
the empire’s big explorers medicine chests 
stocked with tabloids— Robert Scott took 
one with him on his ill-fated expedition to 
the South Pole. (Only later was the term 
tabloid adopted by newspapers filled with 
condensed, easily digestible stories.) 

Wellcome’s private life was less success- 
ful. In 1901, he married Syrie Barnardo, a 
friend’s daughter, who was less than half his 
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age. There were allegations that he beat her, 
and the couple separated and divorced after 
Barnardo had a child with celebrated author 
William Somerset Maugham. Wellcome be- 
came more reclusive, concentrating on his 
company and his collection of medical and 
historical artifacts. When he died in 1936 at 
age 82, it included hundreds of thousands of 
pieces, including Peruvian mummies, Japa- 
nese sex aids, and Napoleon’s toothbrush. 

He had also built a worldwide business, 
which his will converted into a founda- 
tion whose shares were owned exclusively 
by a new trust. The Wellcome Trust was 
charged with “the advancement of medical 
and scientific research to improve man- 
kind’s wellbeing.” 



AFTER HIS STAFF MEETING, Farrar settles 
into a rhythm of half-hour and one-hour 
meetings in his large corner office, a glass 
box on the fourth floor. His day includes 
a strategy meeting on how to convince 
the UK. government to save science from 
spending cuts, a phone call with WHO’s 
Kieny to discuss the Lancet paper, an inter- 
view for the World Economic Forum’s an- 
nual risk report, and a call with a scientist 
at the US. National Institutes of Health. 
The next day, Farrar is scheduled to fly 
to Seattle, Washington, to meet with Bill 
Gates; the week after he has a meeting at 
the White House. 

It wasn’t always obvious that the trust 
would grow into a global force. Wellcome’s 
company had contracted before his death, 
and the trust initially didn’t have much to 
invest. A 1956 report showed that it had do- 
nated about £1 million in the first 20 years. 
But in the following decades, Wellcome sci- 
entists developed important drugs, such as 
the leukemia compound 6-mercaptopurine 



and azidothjmiidine, the first antiviral to 
treat HIV infections. The company grew 
richer and so did the trust. 

In 1986, the trust started selling off 
Wellcome shares to become less depen- 
dent on just one company. Many of the re- 
maining shares were sold to rival Glaxo in 
1995, enabling that company to take over 
Wellcome. Some of the newfound wealth 
was wiped out in the stock market crash of 
2000, but investments in Facebook, Twit- 
ter, and other companies more than made 
up for the losses. “We went from an organi- 
zation that was spending maybe a million 
pounds a month to about a million pounds 
a day,” says Clare Matterson, the trust’s di- 
rector of strategy. 

On average, the trust’s 
coffers have swelled by 
14% annually since 1986. 
But some of the gains 
are controversial. “With 
increased visibility has 
come increased scrutiny,” 
Wilsdon says. “The trust is 
certainly not used to that.” 
The Guardian launched a 
campaign last year urging 
the trust to pull its money 
from oil giants BP and 
Shell. Farrar has argued 
that money buys influence 
and that it’s better to pro- 
mote change as a share- 
holder than to divest. 

The trust is planning to 
spend roughly $1.3 billion 
on medical research in 
2016, more than the UK. 
government’s Medical Re- 
search Council and more than any other 
charity in the world except the Gates Foun- 
dation. Currently, some 75% of its money is 
spent in the United Kingdom, but Farrar 
plans to fund more science abroad. “Jeremy 
is an internationalist,” Matterson says. 

That’s hardly a surprise considering 
Farrar’s upbringing. His father, whom he 
calls a “rabid socialist” and “a huge influ- 
ence,” was an expatriate who taught Eng- 
lish in Singapore, where Farrar was born, 
and later in Libya, New Zealand, and Cy- 
prus. Early in life, Farrar dreamed of work- 
ing at the foreign office. His wife is Aus- 
trian and their three children were born 
in Vienna. “Delighted & proud that I & my 
whole family are part of the 8 million for- 
eign born ‘migrant’ population,” Farrar re- 
cently tweeted when immigration became 
a major topic in the United Kingdom. 

In the competitive domain of world-class 
research and billion-dollar funding agen- 
cies, it’s hard to lead without making en- 
emies. But Farrar seems to have few. Soft- 
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spoken, he exudes an easy confidence and 
listens attentively, frequently sitting on the 
edge of his chair, chin propped on his hand, 
when a colleague talks. He decides quickly, 
but does not micromanage. “He just under- 
stands people very well,” says Nick White, 
a tropical medicine researcher at Mahidol 
University in Bangkok. 

FARRAR REJECTED the top job at Well- 
come when headhunters first approached 
him in 2012 after then-director Mark 
Walport announced his departure to 
become the United Kingdom’s chief 
science adviser. “The next morning 
I woke up feeling really good about 
that decision,” he recalls. When the 
trust failed to find a suitable can- 
didate and asked him again, Farrar 
said yes: “It was genuinely a sense 
of this very important organization 
that had supported me for a long 
time being without leadership.” His decision 
marked the end of a long sojourn in Asia. 

Farrar had trained as a neurologist in 
London, Edinburgh, and Oxford. At a con- 
ference one day, he decided he could not 
imagine spending the rest of his life in the 
company of the neurologists assembled 
there. “Neurologists have to be very precise 
in medicine and that is not me, actually,” 
Farrar says. “Fm not good at that.” He ap- 
plied to a position, spotted by a friend, as 
head of a University of Oxford research 
group set up by the Wellcome Trust at a 
hospital in Ho Chi Minh City. “He is ex- 



tremely bright and very motivated,” says 
White, who had started the unit and helped 
pick Farrar. “Those things outweighed any 
lack of tropical medicine or infectious dis- 
ease experience.” 

Farrar built up the unit and did research 
on dengue fever, typhoid, and other tropi- 
cal diseases. He’s most proud of a study in 
patients whose tuberculosis infections had 
spread to their brain, a condition that is fa- 
tal in half of all cases. In a large trial, Farrar 
showed that steroids could dramatically in- 



crease survival. But international health isn’t 
just a matter of good science. White says; it’s 
also about understanding people, cultures, 
and organizations. “Jeremy is very good on 
the big picture, he is extremely broad in his 
thinking and imaginative too,” he says. 

In Vietnam, Farrar learned how fright- 
ening an emerging disease can be. In 
January 2004, on the eve of Te t, the Viet- 
namese new year, Farrar received a phone 
call from his colleague Tran Tinh Hien. A 
young girl in another hospital had come 
down with flulike symptoms, but Hien had 
a hunch that something unusual was go- 



ing on. Could Farrar run some tests? Far- 
rar called his wife, Christiane Dolecek, also 
a tropical medicine researcher, to run the 
diagnostics— “because I am useless in the 
lab”— and Hien brought samples on the 
back of his motorbike. In the early hours of 
the morning, it became clear that they were 
dealing with H5N1, a deadly avian influenza 
strain. “That was a very scary moment,” 
Farrar says. 

The fate of his friend Carlo Urbani 
was fresh in Farrar’s mind. The Italian 
parasitologist had been working for 
WHO in Hanoi a year earlier when 
he was called to the bedside of an 
American businessman with flulike 
S 3 miptoms. Urbani realized he was 
probably looking at a new and dan- 
gerous disease, and was one of the 
first to raise the alarm. Severe acute 
respiratory syndrome, or SARS, as 
it became known, is now believed 
to have jumped from civets to humans; it 
soon spread to 37 countries and killed more 
than 800 people. Urbani, married with 
three children, persuaded the Vietnamese 
government to quarantine the hospital— “a 
brave decision,” Farrar says. He became in- 
fected and died. 

Now, Farrar was facing a little-known 
emerging pathogen himself. H5N1 had 
caused an enormous outbreak in Hong 
Kong poultry in 1997 that also sickened 
18 people and killed six. “Is this virus trans- 
missible between people? We didn’t know 
that,” Farrar says. The hospital lacked the 



“We need to be bolder in the stuff we 
fund. If we don’t use our independence, 
what’s the point of having it?” 

Jeremy Farrar 



Sharing the wealth 

The Wellcome Trust’s charitable expenditure has increased sharply since the 1980s. 



£ 775 , 000,000 




dies: Wellcome 
Trust established. 
It holds the entire 
share capital of 
his company, 
Wellcome 
Foundation. 



successful 
drugs boost 
foundation’s 
revenues. 



dationgoes to 
the stock market 
as Wellcome pic. 
The trust sells 
25% of its shares 
for £211 million. 



another £2.3 bil- 
lion in Wellcome 
pic shares. The 
Welcome Trust 
Limited takes 
over as the only 
trustee. 



opens with trust 
as main funder: 
becomes largest 
contributor to the 
Human Genome 
Project. 



ing shares in 
Wellcome pic 
sold to Glaxo. 



SmithKIine 
shares sold for 
almost £1.8 
billion. 



becomes 
director: 
trust’s assets 
reach £16.3 
billion ($26.5 
billion). 



1156 11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 



sciencemag.org SCIENCE 



Published by AAAS 



CREDITS: (GRAPH) M. ATAROD; (DATA SOURCE) WELLCOME TRUST 




Farrar led a research unit at a hospital in Ho Chi Minh City, Vietnam, for 18 years. “Of course, I miss Asia,” he says. 



kind of protective equipment that Western 
hospitals have. “Should you go home in the 
evening to your family? Should you stay in 
the hospital?” 

In the following months, Farrar’s team 
treated many patients and made the best of 
the scientific opportunity. They described 
the disease and its various S5miptoms, 
analyzed which factors predicted survival, 
and showed that oseltamivir helped some 
patients— although some H5N1 strains were 
resistant to it. “The research element was 
done extremely well,” White says. “That got 
him and the unit noticed.” 

THE EBOLA OUTBREAK wasn’t detected 
until March 2014, 3 months after it started. 
By mid- July, more than 1000 people had 
become sick and hundreds had died; cases 
were doubling every 3 weeks or so. It re- 
minded Farrar not of SARS or H5N1, but of 
something far worse: the start of the HIV 
pandemic in the 1980s. As a medical stu- 
dent at St. Stephen’s Hospital in London, 
he had tended many early AIDS patients. 
^ “I still remember this now, the power of 
§ things you have no capacity to treat.” 

^ WHO and many public health experts 
y initially said the world knew how to deal 
^ with Ebola outbreaks: by isolating pa- 
I tients, burying the dead safely and quickly, 
g and monitoring patients’ contacts. Farrar 
I wanted to do more. On 5 August, 3 days be- 
§ fore WHO declared Ebola a Public Health 
I Emergency of International Concern, he 
I published a Wall Street Journal op-ed with 
£ global health veterans David Heymann 



and Peter Piot calling for experimental 
therapies to be made available for test- 
ing and compassionate use. The Wellcome 
Trust announced a multi-million-dollar 
package to fund clinical Ebola research 
2 weeks later. 

The trust ended up funding the Guinean 
vaccine trial and studies of two experimen- 
tal drugs, brincidofovir and TKM-Ebola. 
Both of those trials were stopped early be- 
cause of disappointing results— not what 
scientists had hoped for, WHO’s Kieny says, 
“but they managed to eliminate two prod- 
ucts that were highly regarded.” 

“The world changed after SARS because 
it demonstrated in a very powerful way that 
these viruses could go between animals 
and humans quite easily and then spread 
through the world,” Farrar says. Ebola 
has changed the world again, he says: “It 
showed that you can actually do research in 
these situations.” 

ONE OF FARRAR’S DEAREST possessions 
is an old, worn anthology of poems, a gift 
from his father that sits on his desk. On one 
of the first pages, his mother has scribbled a 
quote from Robert Browning, the Victorian 
poet: “Ah, but a man’s reach should exceed 
his grasp. Or what’s a heaven for?” It could 
be Farrar’s slogan at Wellcome. He wants to 
use the trust’s $29 billion capital to reach 
further. “We need to be bolder in the stuff 
we fund,” he says. “If we don’t use our in- 
dependence, what’s the point of having it?” 

In his first year at the trust, Farrar moved 
to double the number of grants for early- 



and mid-career researchers. “Most of the 
great changes that have happened over the 
last 50 years have been led by people in 
their 20s and 30s, and yet I think on the 
whole science has moved away from that 
population,” he says. Farrar also wants to 
focus the trust’s efforts on areas where it 
can have a major impact. One example is 
understanding what makes pathogens like 
Ebola so good at jumping from animals to 
people and becoming established in the hu- 
man population, he says. 

Farrar has strengthened his ties to the 
Gates Foundation; the two organizations 
now have meetings every 6 months to coor- 
dinate and identify areas where they could 
work together. The two traditionally have 
had different models: “We fund projects, 
Wellcome funds people,” says Chris Wilson 
of the Gates Foundation. Today, they more 
often adopt each other’s approach. “We are 
influencing each other,” Wilson says. 

Farrar has also used his independence to 
express his own opinions. In a Wall Street 
Journal piece earlier this year, he criticized 
WHO, which he said had been “held back by 
deep structural flaws and a chronic refusal 
to lead.” “If WHO doesn’t change now, it will 
never change,” Farrar says. 

He’s well aware that the Ebola crisis has 
given Wellcome and himself more clout, and 
he seems comfortable using it. Yet there are 
moments when he regrets leaving his job 
at the hospital and his life in Ho Chi Minh 
City. “I certainly miss being a clinician, and I 
miss being a clinical researcher,” Farrar says. 
“And, of course, I miss Asia.” ■ 
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Tailpipe to tank 

Researchers are vying to use renewable energy to suck 
carbon dioxide out of the air and turn it back into fuel 



By Robert F. Service 



S tuart Licht has designed the ul- 
timate recycling machine. The 
solar reactor that he and col- 
leagues built in his lab at George 
Washington University in Wash- 
ington, D.C., takes carbon dioxide 
(CO^) from the atmosphere— a 
byproduct of fossil fuel combus- 
tion— and uses the energy in sun- 
light to turn it back into fuel. There are a 
few steps in between. Water is also involved 
in the reaction, which produces hydrogen 
(H^) and carbon monoxide (CO); they in 
turn can be stitched into liquid hydrocar- 
bon fuels. But Licht’s is one of the most ef- 
ficient devices of its type ever constructed. 



It is only one of the solar fuel technologies 
taking shape in labs around the world. They 
embody a dream: the prospect of one day 
bypassing fossil fuels and generating our 
transportation fuels from sunlight, air, and 
water— and in the process ridding the atmo- 
sphere of some of the CO^ that our fossil fuel 
addiction has dumped into it. 

These schemes are no threat to the oil 
industry yet. In Licht’s device, parts of the 
reactor run at temperatures approaching 
1000°C, high enough to require specialized 
materials to hold the components. Other 
researchers are pursuing an alternative 
approach, developing catalysts that could 
carry out the same chemical reactions at 



or near room temperature, using electricity 
from sunlight or other renewables to power 
the chemical knitting process. 

The bigger hurdle is economic. Oil is 
cheap, for the moment, and there is little 
incentive to adopt cutting-edge, costly alter- 
natives. But the relentless march of climate 
change, and the elegance of the concept, 
have drawn researchers around the globe to „ 
the pursuit of solar fuels. “This is a very hot § 
area right now,” says Omar Yaghi, a chemist ^ 
at the University of California (UC), Berke- § 
ley. And as Licht’s reactor demonstrates, J 
the research is making progress. “We’re not | 
there yet, but we’re moving in the right di- | 
rection,” says Andrew Bocarsly, a chemist ^ 
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at Princeton University who is developing 
low-temperature catalysts. 

Enthusiasts even see a glimmer of hope 
for making the technology economical: 
the steady spread of renewable electric- 
ity sources, such as wind farms and solar 
plants. Already, windmills and solar cells 
sometimes generate more power than lo- 
cals can absorb. If this oversupply could 
be stored in chemical fuels, experts argue, 
utility providers might be able to save their 
power for use anytime and an 3 rwhere— and 
make extra money on the side. 

THE NEED FOR LIQUID FUELS is unlikely 
to go away despite concerns about climate 
change. The high energy density and ease 
of transport of gasoline and other liquid 
hydrocarbons have made them the main- 
stay of the world’s transportation infra- 
structure. Researchers continue to pursue 
the use of low-carbon gases, such as meth- 
ane and hydrogen, as transportation fuels, 
and electric cars are proliferating. But for 
long-distance trucks and other heavy ve- 
hicles, as well as aviation, there is no good 
alternative to liquid fuels. Solar fuel pro- 
ponents argue that finding a way to brew 
them from readily available compounds 
such as water and CO^ could make a sizable 
dent in future CO^ emissions. 

The task essentially boils down to run- 
ning combustion in reverse, injecting en- 
ergy from the sun or other renewables 
into chemical bonds. “It’s a very challeng- 
ing problem, because it’s always an uphill 
battle,” says John Keith, a chemist at the 
University of Pittsburgh in Pennsylvania. 
It’s what plants do, of course, to make the 
sugars they need to grow. But plants con- 
vert only about 1% of the energy that hits 
them into chemical energy. To power our 
industrial society, researchers need to do 
far better. Keith likens the challenge to put- 
ting a man on the moon. 

The trouble is that CO^ is a very stable, 
unreactive molecule. Chemists can force 
it to react by pumping in electricity, heat, 
or both. The first step in this process is 
usually ripping off one of CO^’s oxygen 
atoms to make CO. That CO can then be 
combined with to make a combination 
known as syngas, which can be converted 
into methanol, a liquid alcohol that can 
i be either used directly or converted into 
^ other valuable chemicals and fuels. Massive 
m chemical plants do just that, but they make 
I their S 5 mgas not from air, but from plenti- 
5 ful and cheap natural gas. So the challenge 
a for chemists is to create syngas from renew- 
I ables more cheaply than current sources 
< can match. 

g Licht, who calls his solar-generated 
I mixture of CO and “sungas,” says he’s 



taking aim at that challenge by using 
both heat and electricity from the sun. 
His setup, which he details in a paper ac- 
cepted at Advanced Science, starts with 
a high-end commercially available solar 
cell called a concentrated photovoltaic. It 
focuses a broad swath of sunlight onto a 
semiconductor panel that converts 38% of 
the incoming energy into electricity at a 
high voltage. The electricity is shunted to 
electrodes in two electrochemical cells: one 
that splits water molecules and another 



that splits CO^. Meanwhile, much of the 
remaining energy in the sunlight is cap- 
tured as heat and used to preheat the two 
cells to hundreds of degrees, a step that 
lowers the amount of electricity needed to 
split water and CO^ molecules by roughly 
25%. In the end, Licht says, as much as 50% 
of the incoming solar energy can be con- 
verted into chemical bonds. 

It’s unclear whether that process will 
produce syngas that’s as cheap as that 
made from natural gas. But Licht notes 
that a 2010 economic analysis of his solar 
water splitting setup alone, which he first 
described in 2002, concluded that his ap- 
proach could generate a kilogram of the 

energy equivalent of 4 liters of gasoline— at 
a cost of $2.61. 

Yet it may be hard for Licht’s sungas 
setup to lower the price further. Licht’s 
charge-conducting electrolyte uses lithium, 
a somewhat rare and costly metal whose 
limited supplies could prevent a massive 
scale-up. Licht also faces competition from 
other researchers who also use high tem- 
peratures to ease the splitting of water and 
CO^, but rely entirely on electricity instead 
of solar heating. But like sungas, those 
schemes, called solid oxide electrolysis 
cells, face the longevity challenges of run- 
ning at high temperatures. 



GIVEN THESE HURDLES, Bocarsly and 
others continue to try to split CO^ at lower 
temperatures. One such approach is already 
commercial. In Iceland, a company called 
Carbon Recycling International opened a 
plant in 2012 that uses renewable energy to 
create syngas. The company harnesses the 
island’s abundant geothermal energy to pro- 
duce electricity, which drives electrolysis ma- 
chines that split CO^ and water. The resulting 
syngas is then turned into methanol. 

Of course, most regions of the globe 



lack Iceland’s abundant geothermal power 
needed to drive the process, so research- 
ers are hunting for new catalysts that can 
split CO^ with less energy. These catalysts 
typically sit on the cathode, one of two 
electrodes in an electrolytic cell containing 
water. At the opposite electrode, water mol- 
ecules are split into electrons, protons, and 
oxygen, which bubbles away. The electrons 
and protons pass to the cathode, where CO^ 
molecules split into CO and oxygen atoms 
that combine with the electrons and pro- 
tons to make more water. 

Today, the gold standard for such cata- 
lysts is, well, gold. In the 1980s, Japanese 
researchers found that electrodes made 
from gold had the highest activity for split- 
ting CO 2 to CO of all the low-temperature 
setups. Then in 2012, Matthew Kanan, 
a chemist at Stanford University in Palo 
Alto, California, and colleagues discovered 
something even better: Making their elec- 
trode from a thin layer of gold divided into 
nanosized crystallites, they reported in the 
Journal of the American Chemical Society, 
slashed the electricity needed by more than 
50% and increased the catalyst’s activity 
10-fold. The boundaries between the gold 
crystallites appear to promote the reaction. 

At some $36,000 per kilogram, gold is 
still far too expensive for use on a mas- 



Powered by geothermal energy, this plant in Iceland turns carbon dioxide into syngas and ultimately methanol fuel. 
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Conjuring chemical cornucopias out of thin air By Robert F. Service 



A fill-up at the gas station may seem expensive, but fuels are 
relatively cheap commodities. So would-be makers of 
solar fuels are looking for ways to apply their technology 
to making more valuable materials. 

Last month, for example, Stuart Licht, a chemist 
at George Washington University in 
Washington, D.C., and colleagues reported 
in Nano Letters that they had developed a 
version of their solar reactor technology 
(see main story, p. 1158) that can take CO^ 
out of the air and convert it into solid 
carbon nanofibers. The researchers found 
that when they added trace amounts of 
either nickel, copper, cobalt, or iron to 
their electrolysis cell, the metals form tiny 
islands on the cathode that then serve 
as landing sites for thousands of split-off 
carbon atoms to insert themselves and 
quickly grow into long, thin fibers. Licht’s 
team had previously shown they could 
turn CO 2 into a solid carbon material 
called amorphous carbon, which is worth some $1000 per 
metric ton. But carbon nanofibers could be worth far more, 
perhaps as much as $25,000 per metric ton, because they are 
widely used in making lightweight, high-strength composites 
for applications such as car bumpers and airplane parts. “We 



are making a valuable commodity that we hope will produce a 
driving force for using this technology,” Licht says. 

Others are pursuing the same strategy. A small New 
Jersey-based company called Liquid Light is working to 
commercialize technology for converting CO^ into ethylene 
glycol, a commodity chemical with a 
$27 billion annual market. Another 
company, Skyonic, recently opened a 
demonstration plant in Texas that turns 
CO 2 into baking soda, hydrochloric acid, 
and bleach. 

Such commodities aren’t manufactured 
on a scale anywhere near that of gasoline 
and other transportation fuels. So making 
them with CO^ siphoned off a smoke- 
stack or pulled from the air isn’t likely to 
make a sizable impact on global atmo- 
spheric CO 2 levels, says Matthew Kanan, 
a chemist at Stanford University in Palo 
Alto, California, who is working to con- 
vert CO 2 into plastics and other higher 
value commodities. However, he says, “Perhaps we can use 
that as a stepping stone.” With time and experience, compa- 
nies may then find way to improve their processes, lower their 
costs, and begin to make high-volume, low-cost compounds 
such as fuels. “I’m a technology optimist,” Kanan says. ■ 




sive scale. Last year, however, researchers 
led by Feng Jiao, a chemist at the Univer- 
sity of Delaware (UD), Norwalk, reported 
in Nature Communications that catalysts 
made from silver nanoparticles do almost 
as well. And this year, they reported in ACS 
Catalysis that even cheaper catalysts made 
from tiny zinc spikes called dendrites are 
also proving highly effective at churning 
out CO. 

Catalysts that could be even cheaper are 
in the works. Researchers at UC Berkeley, 
for example, reported last month that they 
had made a highly porous crystalline mate- 
rial out of organic ring-shaped compounds 
with a combination of cobalt and copper 
atoms at their core. When layered atop 
an electrode and dunked in a water-based 
solution, the porous materials split CO^ 
molecules into CO at a rate of 240,000 per 
hour— a furious pace compared with most 
other room-temperature catalysts. And last 
year, Kanan and his colleagues reported 
that electrodes made of nanocrystalline 
copper could bypass the need for syn- 
gas, allowing them to directly synthesize 
a variety of more complex liquid fuels, 
such as ethanol and acetate, at unprec- 
edented efficiencies. 

Researchers worldwide are also pur- 
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suing another rich vein: driving the 

low-temperature electrolysis of CO^ and H^O 
with energy directly from sunlight. Most 
efforts center on using light-absorbing 
semiconductors, such as titanium dioxide- 
based nanotubes, to churn out CO, meth- 
ane, or other hydrocarbons. So far, such 
setups aren’t very efficient; typically they 
convert less than 1% of the incoming solar 
energy into chemical bonds. Bocarsly and 
others have done better using the sun’s ul- 
traviolet light, which makes up only a tiny 
part of the spectrum. But at the American 
Chemical Society meeting in Boston last 
month, Joel Rosenthal, a chemist at UD 
Newark, reported that his team has devel- 
oped a bismuth-based photocatalyst that 
converts 6.1% of incoming visible light en- 
ergy to chemical bonds in CO. 

Despite progress on all these fronts, 
Kanan cautions that solar fuels still have 
a long way to go to compete directly with 
liquid fossil fuels, especially now that the 
price of oil has fallen below $50 per barrel. 
And barring a concerted push from gov- 
ernments worldwide to cap or tax carbon 
emissions, solar fuels may never be able 
beat oil-derived fuels on cost alone. “It’s a 
tall order,” he says. 

But Paul Kenis, a solar fuels researcher at 



the University of Illinois, Urbana-Champaign, 
argues that the broad penetration of solar 
and wind power offers hope. Denmark, for 
example, already produces some 30% of its 
electricity from wind farms and is on pace to 
reach 50% by 2020. On a particularly blus- 
tery day in July, the nation’s wind turbines 
generated as much as 140% of the country’s 
electrical requirements. The excess was sent 
to its neighbors, Germany, Norway, and Swe- 
den. But the oversupply added to utilities’ 
fears that in times of peak renewable power 
production, the value of electricity could fall 
to zero or even below, as producers would 
have to pay others to take it so as not to 
damage their grid. 

That’s where solar fuel producers could 
stand to benefit, Kenis says: By absorbing 
that power and using it to make fuels and 
other commodities, they could essentially 
act as energy banks and perhaps earn some ^ 
cash as well. For now, Kanan argues, it still ° 
makes the most economic sense simply to | 
shunt excess renewable power into the | 
grid, displacing fossil energy. But someday, | 
if renewable power becomes widespread | 
enough and the technology for making re- ° 
newable fuels improves, we may be able to | 
guzzle gas without guilt, knowing we are g 
just burning sunlight. ■ I 
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Megathrust aftermath. Tsunami produced by the 2011 magnitude 9 Tohoku subduction earthquake. 



GEOPHYSICS 

Weak subduction makes great quakes 

Small earthquakes reveal low stress levels at megathrust zones and in surrounding crust 



By Roland Biirgmann 

T he world’s greatest earthquakes, 
producing catastrophic shaking and 
tsunamis, occur in subduction zones. 
Here oceanic plates dive below ad- 
joining regions along megathrust 
faults (see the figure). The recent 
magnitude ~9 megathrust earthquakes in 
Sumatra, Chile, and Japan, with fault dis- 
placements of several tens of meters, were 
stark reminders of the destructive power 
of these events. On page 1213 of this is- 
sue, Hardebeck (i) uses the orientations 
of fault planes of thousands of smaller 
earthquakes near and above the world’s 
megathrusts to evaluate the state of stress 
driving these great events. The general 
conclusion made is that all faults in sub- 
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duction zones, including the megathrusts, 
are unusually weak. 

Plate tectonic forces drive the subduction 
process and earthquakes. Stress at a par- 
ticular point can be fully described by the 
orientations and magnitudes of three mu- 
tually perpendicular, principal (maximum, 
intermediate, and minimum) compressive 
stresses. By definition, shear stress is zero 
on planes perpendicular to the principal 
stresses. As Earth’s free surface cannot sup- 
port shear stress, the principal stresses in 
the shallow crust are expected to be parallel 
and perpendicular to the surface, a state of 
stress referred to as Andersonian (2). 

The magnitude of stresses in the brittle 
crust is bounded by the frictional strength 
of faults. The coefficient of friction deter- 
mines the strength of a fault as a linear 



function of the fault-perpendicular normal 
stress. Laboratory-derived estimates of fric- 
tion coefficients for most rock types suggest 
that fault strength and stress will linearly 
increase with increasing rock overburden 
within Earth, reaching several hundred 
megapascals (MPa) at depths where subduc- 
tion earthquakes occur (3). 

Our knowledge of the actual stresses 
driving earthquake faulting remains lim- 
ited and is based on indirect or incomplete 
measures of stress. It is nearly impossible “ 
to directly measure absolute stress where | 
earthquakes occur, as this requires observa- h 
tions in costly deep boreholes ( 4 ). A number | 
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of geophysical studies in some subduction 
zones have suggested that fault friction val- 
ues and stress may be much lower than ex- 
pected from laboratory experiments. Such 
studies have relied on estimates of frictional 
heat generated by faulting (5, 6), the orien- 
tation of stresses estimated from the geom- 
etry of aftershock fault planes (7), as well 
as the apparent large rotation in principal 
stress orientations from before to after the 
2011 Tohoku earthquake in Japan (8, 9). 

Hardebeck focuses on using the fault 
plane geometry of small earthquakes near 
and above the megathrust of global subduc- 
tion zones to determine the orientations of 
the principal stresses and constrain their 
magnitudes. The orientation of the maxi- 
mum principal stress with respect to the 
megathrust provides information about 
the relative strength of the fault and sur- 
rounding crust. If the megathrust is weak 
(low friction) in an otherwise strong crust, 



we would expect the principal stresses to 
be nearly perpendicular and aligned with 
the fault, similar to the case for the free 
Earth’s surface. If both the megathrust and 
surrounding material are either weak or 
strong, we would expect the maximum com- 
pressive stress to be oriented at an oblique 
angle (20° to 60°) from the fault. 

Contrary to expectations, Hardebeck finds 
that the orientation of maximum compres- 
sive stress near most subduction thrusts 
plunges toward the trench at angles of 10° 
to 50°. This indicates that the state of stress 
is not Andersonian and that the maximum 
compressive stress is oriented roughly 30° 
from the megathrust. This orientation is 
expected for faults with frictional strength 
similar to that of the surrounding crust; 
that is, they are either both strong or both 
weak. Using evidence of absolute megathrust 
weakness from studies documenting low 
frictional heat production (5, 6), stress ro- 
tations due to great earthquakes (8, 9), and 
high fluid pressures in fault zones (10), Hard- 
ebeck argues that both the megathrust and 



surrounding crust must be weak, with only a 
few exceptions. Stresses are likely no higher 
than a few tens of MPa, rather than hun- 
dreds of MPa expected from standard fric- 
tion theory. Apparently, not that much stress 
is required to make a great earthquake fault 
slip, and when it does, it drops the ambient 
stress to extremely low levels. 

Hardebeck suggests that most faults in 
the world’s subduction zones are weak. 
What might produce such weakness? Pos- 
sible explanations include intrinsically low 
friction of fault zone materials composed of 
clays and talc (11), high fluid pressures that 
push against the normal stress on faults 
(10), and dynamic shear weakening during 
earthquake fault slip (12). It is important 
to remember that the inference of stress 
is indirect and depends on the accuracy 
of the fault plane orientations and various 
assumptions made when inferring stress 
orientations from the fault slip data. To im- 
prove our knowledge of the mag- 
nitude and orientation of stress in 
subduction zones and the factors 
determining their variations in 
space and time, we need much- 
improved observations. More 
detailed observations of small 
earthquakes and their fault plane 
orientations near the megathrust 
require seafloor seismic sensors. 
Seafloor geodetic measurements 
can capture the strain associated 
with evolving stress (13). Improved 
in situ observations are possible 
by drilling into a megathrust at 
greater depths to directly observe 
the composition, stress, and condi- 
tions in the fault zone (14). The distribu- 
tion of absolute stress in space and time 
is probably the most important quantity 
in solid earth sciences that we know the 
least about. The analysis reported in Hard- 
ebeck’s paper takes an important step to- 
ward changing that. ■ 
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EVOLUTION 

How single 
cells work 
together 

Are single-celled 
symbioses organelle 
evolution in action? 

By Jonathan P. Zehr 

S ymbiotic interactions are fundamen- 
tal to life on Earth and were critical 
for the evolution of organelles that 
led to the success of eukaryotes on 
the planet. Such mutualistic inter- 
actions between unicellular micro- 
organisms and multicellular plants and 
animals are pervasive in natural and ag- 
ricultural ecosystems (1). In contrast, very 
little is known about symbiotic interactions 
between unicellular partners. Recent stud- 
ies have revealed single-celled nitrogen- 
fixing symbioses that require different 
mechanisms to maintain symbiosis than 
seen in multicellular systems. 

In aquatic environments, there are many 
examples of protists that acquire endo- 
symbionts or plastids (2). These types of 
symbioses are analogous to the evolution 
of organelles, such as the photosynthetic 
organelle of the unicellular eukaryote Pau- 
linella (3). Other unicellular mutualistic 
associations are nitrogen-fixing symbioses— 
for example, between filamentous cyano- 
bacteria and marine diatoms (see the figure, 
panel A) (4). The filamentous cyanobacteria 
in these associations develop specialized 
cells (heterocysts) that allow them to sepa- 
rate the oxygen generated by photos 5 mthesis 
from the oxygen-sensitive nitrogen fixation 
enz 3 mies. In these S 3 mibioses, the intracel- 
lular cyanobacteria coordinate growth and 
division and are transmitted through differ- 
ent life stages (5). More recent studies have 
found evidence for nitrogen-fixing symbio- 
ses between unicellular cyanobacteria and 
single-celled protists that do not involve the 
development of specialized cells. 

One example is a nitrogen-fixing sym- 
biosis between the single-celled cyanobac- 
terium UCYN-A (unicellular cyanobacteria 
nitrogen-fixing “group A”) and a prymne- 
siophyte microalga (see the figure, panel B) 
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Weak subduction zones. The orientation of compressive stress 
inferred from the geometry of many small earthquake fault planes 
suggests a weak megathrust in a weak surrounding crust. 
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(6, 7). This symbiosis is one of a few docu- 
mented S5mibioses with a prymnesiophyte 
(a group of protists) and the only known ni- 
trogen-fixing symbiosis with a haptophyte. 
Furthermore, the nitrogen-fixing cyanobac- 
terial partner exhibits dramatic genomic re- 
ductions that have led to a very high degree 
of metabolic streamlining (8). A second 



example is a similar symbiosis between a 
freshwater diatom and a nitrogen-fixing cy- 
anobacterium (simply called spheroid bod- 
ies; see the figure, panel C) with a small and 
metabolically streamlined genome (9, 10). 
Both of these cyanobacterial genomes are 
greatly reduced compared to those of other 
cyanobacteria. Although UCYN-A retains a 
complete photosystem I, it does not evolve 
oxygen and may instead allow for energy or 
reductant production. The spheroid bod- 
ies have completely lost both photosystem 
I and photosystem II. Both genomes have 
also lost part or all of the TCA (tricarboxylic 
acid) cycle, and have lost the ability to fix 
carbon dioxide into organic matter. They 
thus lack the defining characteristics of 
cyanobacteria and are only cyanobacteria 
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in the sense that they evolved within the 
cyanobacteria lineage. 

These single-celled sjmibioses are in- 
triguing models of nitrogen-fixing S5mibio- 
sis in general, but are perhaps even more 
interesting as models of the evolutionary 
events that led to eukaryotic organelles. 
The genome reduction implies that the 
cyanobacterium must be obli- 
gate and implies an evolution- 
ary “luggage” strategy of the 
host {11), because it must always 
carry the symbiont even if is not 
physiologically advantageous at 
all times. 

It is curious that these sin- 
gle-celled symbioses did not 
ultimately give rise to a nitrogen- 
fixing organelle, given the impor- 
tance of nitrogen as a nutrient 
that limits the productivity of ter- 
restrial and aquatic ecosystems. 
The unicellular cyanobacterial 
sjmibiotic partners are as close 
to being such nitroplasts as any- 
thing yet known. Indeed, some 
features of the UCYN-A reduced 
genome are very reminiscent of 
the features of algal chloroplast 
genomes: For example, it exists 
in the cell in two different forms 
that are caused by rearrange- 
ments of the chromosome brack- 
eted by inverted sequence repeat 
regions. Perhaps UCYN-A and 
similar microorganisms {4, 9) ef- 
fectively serve as such nitrogen- 
fixing organelles. 

The nature of single-celled 
S3mibiosis is important for un- 
derstanding how organelles 
evolved and what the minimum 
constraints for symbiosis are. 
S 3 mibiotic interactions with mul- 
ticellular plants or animals have 
additional needs beyond the 
metabolic exchanges that form the core of 
nutritional, or syntrophic, mutualistic in- 
teractions. Signaling, mobility, cell and tis- 
sue development, entrance through tissue 
or cell walls, tissue modifications, or struc- 
ture development are likely all unnecessary 
for a symbiosis that involves just two cells, 
or a small number of cells. By contrast, 
transmission of the symbiotic partner to fu- 
ture generations of host cells maybe critical 
in the dilute ocean environment where con- 
centrations of cells are low, and growth and 
cell division must be highly coordinated to 
ensure that the growth of one partner does 
not outstrip that of the other. 

In single-celled S 5 mibioses, both partners’ 
growth and cell division must be coordi- 
nated. The UCYN-A prymnesiophyte host 




Examples of unicellular symbioses. (A) Marine diatom with 
heterocyst-forming filamentous cyanobacterium. (B) Fluorescent 
in situ hybridization micrograph of UCYN-A (red) and its symbiotic 
partner, the prymnesiophyte Braarudosphaera bigelowii (green). Blue, 
DAPI (4 ',6-diamidino-2-phenylindole)-stained nucleus. Two types of 
this cyanobacterium are now known: UCYN-Al (left) and UCYN-A2 
(right). (C) Rhopaloid freshwater diatom showing spheroid body. 



was first visualized with a single cyanobac- 
terial cell (6). A few observations have now 
been made of two UCYN-A cells on one 
host (12), suggesting that the cyanobacterial 
growth and division occur prior to the prym- 
nesiophyte cell division. UCYN-A’s close rela- 
tive UCYN-A2 (13) forms a cluster of a few 
cells encased in a cellular pocket of the host 
prymnesiophyte. It is unclear how this group 
of cells grows and divides in synchrony with 
the attached host prymnesiophyte cell. 

It is not certain whether the UCYN-A 
cyanobacteria are enclosed in a membrane. 
One observation of UCYN-A (UCYN-A2) 
suggests that the symbionts are intracellu- 
lar (14). Regardless of its position inside or 
outside the cell, there must be channels or 
porters that allow exchange of fixed nitro- 
gen for carbon, and perhaps even proteins, 
as in the photosynthetic organelle of Pauli- 
nella (3). There are now many international 
reports of UCYN-A, showing that the sym- 
biosis is present at much wider geographic 
scales than is typical for the better-known 
oceanic nitrogen-fixing microorganisms 
(12, 15). Many similar symbioses probably 
exist in the environment, but are difficult to 
detect or study because they are small, not 
easily visualized or characterized, and diffi- 
cult to manipulate experimentally. UCYN-A 
and the freshwater diatom symbionts offer 
a snapshot of what is likely to be found and 
provide motivation for looking harder in 
the environment for examples of microbial 
symbiosis and, perhaps, organellar evolu- 
tion in action. ■ 
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CLIMATE 

An increasing carbon sink? 

Southern Ocean carbon uptake may have strengthened 
between 2002 and 2012, slowing climate change 



By S. E. Mikaloff-Fletcher 

S ince 1870, Earth’s oceans have ab- 
sorbed more than one-quarter of the 
carbon dioxide emitted to the atmo- 
sphere from fossil fuel burning and 
other human activities, thereby dra- 
matically slowing climate change (i). 
The Southern Ocean is responsible for -40% 
of this global ocean carbon sink {2). Recent 
studies have suggested that the 
rate of carbon uptake by the South- 
ern Ocean may be slowing {3, 4). 

Such a positive climate feedback 
effect would reduce the Southern 
Ocean’s capacity to slow climate 
change. On page 1221 of this issue, 
Landschiitzer et at. show that al- 
though the rate of carbon uptake 
by the Southern Ocean slowed be- 
tween the 1980s and early 2000s, 
it began to strengthen again in 
2002 and continued to do so until 
at least 2012 (5). 

Air-sea fluxes of CO 2 are propor- 
tional to the difference in partial 
pressure of CO 2 between the at- 
mosphere and the surface ocean, 

APCO 2 {6). Therefore, increases in 
atmospheric CO 2 concentrations 
are expected to drive increases 
in ocean CO 2 uptake. In 2007, Le 
Quere et at. suggested that South- 
ern Ocean carbon uptake began 
leveling off in the early 1980s 
while atmospheric CO 2 continued 
to rise (see the figure) (5). The au- 
thors based their conclusions on an analysis 
of ocean models and an atmospheric inverse 
model that estimates air-sea and air-land 
CO 2 fluxes from atmospheric CO 2 measure- 
ments and atmospheric model simulations. 
They and others attributed this trend to the 
strengthening and poleward shift of the west- 
erly winds that overlay the Southern Ocean, 
bringing carbon-rich waters to the surface 
and thus reducing (3, 4). Because the 

^ changes in the westerly winds are linked 
I to climate change and ozone depletion, the 
I rate of carbon uptake by the Southern Ocean 
I would then be expected to continue to slow 
^ in response to future climate change (3). 
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The hypothesized weakening of the 
Southern Ocean carbon sink has been the 
subject of vigorous debate, because many 
processes that play a critical role in the 
Southern Ocean are not well represented 
in coarse-resolution ocean models (7, 8). 
Furthermore, estimates of trends in the 
Southern Ocean carbon sink from atmo- 
spheric inversion models are sensitive to 
data selection (9). 



Landschiitzer et al have now analy7:ed 
trends in a global data set of ApCO^ measure- 
ments that includes 2.6 million observations 
in the Southern Ocean spanning 30 years 
{10). They used two novel statistical methods 
to interpolate ship data in space and time: a 
neural network approach that incorporates 
physical and biological observations, and a 
method that assimilates the ApCO^ data into 
a mass budget of the mixed layer. As in the 
earlier study of Le Quere et al. (3), they also 
compared their results with complementary 
estimates from atmospheric data. The analy- 
sis confirms that the rate of carbon uptake 
by the Southern Ocean slowed down between 
the early 1980s and the early 2000s. How- 
ever, all three approaches reveal a striking 
reversal in this trend in 2002. From 2002 to 



2012, Southern Ocean carbon uptake began 
to strengthen, a result that would not have 
been apparent during the time period stud- 
ied by Le Quere et al. (see the figure). 

Further support for this finding comes 
from a coincident study by Munro et al. 
based on carbon measurements collected in 
the Drake Passage between 2002 and 2015 
{11). Although focused on a single region, this 
study has a remarkable temporal coverage 
of more than 20 cruises per year. The results 
show a clear increase in ApCO.^, Drake 
Passage south of the Antarctic Polar Front 
from 2002 to 2015, implying a strengthening 
of the carbon sink in this region. 

The reinvigoration of the ocean carbon 
sink demonstrated by the two studies (5, 11) 
presents a puzzle. Between 2002 and 2012, 
there has been no reversal in the strength- 
ening of westerly winds thought 
to have caused the slowdown 
between the early 1980s and the 
early 2000s (5). Landschiitzer et 
al. argue that the reinvigoration 
of the sink since 2002 may result 
from a combination of regional 
trends in sea surface temperature 
(which controls CO 2 solubility) and 
circulation-driven changes in the 
carbon balance at the sea surface. 

Although these studies represent 
an intriguing new picture of the 
Southern Ocean sink in the recent 
past, it is not yet clear how this re- 
gion will respond to future changes 
in climate. The Southern Ocean 
has long been undersampled, par- 
ticularly during the austral winter, 
which may be a critical period for 
the reinvigoration of the carbon 
sink {11). An improved understand- 
ing of key processes controlling 
this important carbon sink will 
require more observations with 
higher spatial and temporal resolu- 
tion of carbon and other physical 
and biogeochemical properties in the South- 
ern Ocean and the overlying atmosphere. ■ 

REFERENCES AND NOTES 

1. C. Le Quere etal., Earth Syst. Sci. Data 7, 47 (2015). 

2. I L. Frolicher ef a/., J. Clim. 28, 862 (2015). 

3. C. Le Quere etal., Science 316 , 1735 (2007). 

4. N. S. Lovenduski, N. Gruber, S. C. Doney, Global 
Biogeochem. Cycles 22, GB3016 (2008). 

5. P. Landschutzerefa/.,Sc/ence349,1221(2015). 

6. R. Wanninkhof, J. Geophys. Res. 97,7373 (1992). 

7. T. Ito, M. Woloszyn, M. Mazloff, Nature 463, 80 (2010). 

8. K. B. Rodgers etal.. Biogeosciences 11 , 4077 (2014). 

9. R.M.Lawefa/., Sc/ence 319 , 570a (2008). 

10. D. C. E. Bakkeref a/.. Earth Syst. Sci. Data 6 , 69 (2014). 

11. D. R. Munro ef a/., Geophys. Res. Lett. 
10.1002/2015GL065194(2015). 

ACKNOWLEDGMENTS 

Supported by NIWA core funding through the Greenhouse 
Gases, Emissions and Carbon Cycle Science Programme. 

10.1126/science.aad0912 

11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 1165 




Beyond the slowdown. Southern Ocean CO 2 uptake anomalies from 
Le Quere et al. (3) (red) and Landschutzer et al. (5) (blue), averaged between 
35°S and 90°S. Both studies found a flattening of the Southern Ocean carbon 
sink between the 1980s and early 2000s despite rising atmospheric CO 2 
concentrations. On the basis of longer-term data, Landschutzer et al. report a 
reinvigoration of this sink in the mid-2000s. Positive values reflect ocean uptake: 
anomalies are calculated relative to the 1980s average. 
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IMMUNOLOGY 

Finises carry antiviral cargo 

Infected cells generate a factor that is incorporated into 
viruses and transferred to other cells 



By John W. Schoggins 

I n the molecular arms race between vi- 
ruses and their host cells, each side em- 
ploys multiple strategies to deal with the 
other. Whereas the host has sophisticated 
antiviral signaling programs to combat 
viral infection, viruses use their own pro- 
teins to subvert these host defenses. Viruses 
are often lauded for these clever evasion tac- 
tics. However, the host may also have its own 
brand of molecular chicanery. On pages 1232 
and 1228 of this issue, Gentili et at. {!) and 
Bridgeman et at. (2), respectively, show that 
during infection, a host cell-derived antiviral 
molecule is packaged inside viral particles. 
As a stowaway, the antiviral factor is poised 
to trigger immune defense pathways upon 
infection of another host cell. 

During a viral infection, the host cell uses 
specialized proteins to sense viral nucleic ac- 
ids. For viruses with double-stranded DNA 
genomes or DNA-based intermediates (e.g., 
retroviruses), a major sensor is the cytosolic 
cyclic guanosine monophosphate (GMP)- 
adenosine monophosphate (AMP) S 5 mthase 
(cGAS) (3, 4), a DNA binding protein that 
catalyzes the formation of the heterodinu- 



cleotide cyclic GMP-AMP (cGAMP) in re- 
sponse to DNA (5). Signaling by cGAMP, the 
founding member of a family of metazoan 
cyclic heterodinucleotides, parallels analo- 
gous second messenger functions of cyclic 
dinucleotides in bacteria (6). In mammalian 
cells, cGAMP binds to the protein stimulator 
of interferon genes (STING), which triggers 
the production of interferons, cytokines that 
establish an antiviral state. 

The importance of the cGAS-cGAMP path- 
way has been demonstrated clearly in vivo. 
Mice that lack cGAS are highly susceptible 
to multiple DNA viruses (7, 8). Moreover, 
cGAMP function does not appear to be lim- 
ited just to the infected cell, as it can passively 
diffuse across cellular membranes to trigger 
antiviral signaling in neighboring cells (9). 
Gentili et al. and Bridgeman et al. find that 
cGAMP can also transmit antiviral signals by 
moving as cargo inside newly formed viruses. 

To demonstrate this novel mode of cGAMP 
transmission, both studies used HFV-l-based 
lentiviral vectors in cell culture models of in- 
fection. When the viruses were produced in 
cells overexpressing cGAS, the authors could 
detect transmission of an antiviral signal into 
newly infected cells. Furthermore, STING ex- 



pression was required for the transmitted 
signal to stimulate interferon production in 
the newly infected cells. Gentili et al. and 
Bridgeman et al. also confirmed, using mass 
spectrometry, that cGAMP was indeed inco- 
porated into viral particles. 

Gentili et al. and Bridgeman et al. ex- 
tended their studies to other enveloped 
viruses and cell types. For example, HIV-1- 
based vectors bearing envelope proteins 
from diverse viruses, including vesicular sto- 
matitis virus, influenza A virus, and thogo- 
tovirus, could transmit the cGAMP signal 
through this stowaway mechanism. HIV-2 - 
based vectors, specific HIV-1 strains, and 
murine leukemia virus, did too. Importantly, 
two enveloped viruses with double-stranded 
DNA genomes— vaccinia virus and murine 
cytomegalovirus— also incorporated cGAMP 
into the viral particles. Both studies further 
show that endogenous concentrations of 
cGAS were sufficient for cGAMP inclusion 
in viral particles. Also, primary dendritic 
cells showed enhanced activation (reflected 
as the production of cytokines) by lentiviral 
vectors when cGAMP was preloaded into the 
viral particles. Together, these findings sug- 
gest that this mechanism of cGAMP cell-to- 
cell transport may exist in nature and may 
be harnessed for therapeutic applications 
such as vaccination. 

But is cGAMP loaded into viral particles 
in a natural setting? HIV-1 normally evades 
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Stowaway. Viral DNA generated during a viral infection is sensed by cGAS, which produces the cGAMP dinucleotide. cGAMP is incorporated as cargo into newly formed virions that exit the | 

cell and infect nearby cells. cGAMP is released into the cytosol of the newly infected cell, binds STING, and triggers downstream antiviral signaling. IRF3, interferon regulatory factor 3. d 
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Plant precursor. Epotoside is a 
commonly used chemotherapy 
drug that is derived from a natural 
compound in mayapple plants. 



PLANT BIOCHEMISTRY 

Fightii^ cancer while 
saving the mayapple 

The genes required for synthesizing a plant-derived 
anticancer compound are identified 



cGAS-mediated detection, suggesting that 
the viral particles that emerge from an in- 
fected cell would likely not incorporate 
cGAMP (10). However, Gentili et al. and 
Bridgeman et al. show that a poxvirus and 
a herpesvirus (both double-stranded DNA 
viruses) can take up cGAMP into viral par- 
ticles. Perhaps the stowaway mechanism is 
restricted to certain classes of DNA viruses 
or retroviruses that lack sophisticated eva- 
sion tactics. 

Another question is whether cGAMP in- 
corporation into virions is a host-directed 
strategy or simply a consequence of stochas- 
tic fluid-phase uptake of cytosolic material 
into viral particles. By necessity, enveloped 
viruses must use host membranes to com- 
plete the virus assembly and maturation 
process. Viruses incorporate a variety of 
host molecules into their virions such as 
histocompatibility proteins, tetraspanins, 
or even antiviral molecules like apolipo- 
protein B messenger RNA editing enzyme, 
catalytic polypeptide-like 3G (APOBEC3G) 
in the case of HIV-1 {11). Yet, it is not clear 
if this process is regulated by the host. It 
is tantalizing to speculate that the host is 
as wily an adversary as the virus, with the 
ability to sneak an antiviral molecule into 
outbound viral particles. 

What are the biological implications of 
cGAMP incorporation into viral particles? 
The host might benefit on multiple levels. 
A virus carrying cGAMP may promote long- 
distance transmission of antiviral signals to 
cells and tissues located far from the initial 
site of infection. Similarly, a cGAMP-loaded 
virion could serve as a sentinel of innate im- 
mune signaling during transmission of the 
virus from one host to another, potentially 
even crossing species barriers. The main 
benefit to the host could be rapid initia- 
tion of cGAMP-mediated signaling, even in 
cells that cannot mount antiviral responses 
due to poor cGAS expression. It is not yet 
clear, however, whether cGAMP loading is 
a propagating mechanism. Further studies 
will be needed to determine whether cells 
activated into an antiviral state by incoming 
virus carrying cGAMP also produce virions 
that carry cGAMP. ■ 
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By Sarah E. O’Connor 

P lants synthesize an abundance of 
metabolites that can be exploited for 
pharmacological purposes {!). The 
pool of plant metabolites that can be 
considered medicinally important 
is greatly expanded when consider- 
ing that many plant natural products can 
be used as a scaffold for derivatization, with 
the resulting unnatural analogs often having 
either improved or novel medicinal activity. 
Typically unnatural analogs are made semi- 
symthetically by chemically modifying natu- 
ral biosynthetic intermediates. However, on 
page 1224 of this issue, Lau and Sattely (2) 
report the discovery of a set of biosynthetic 
enzymes in mayapple {Podophyllum) plants 
that can produce a compound that is a di- 
rect precursor to etoposide, an “unnatural” 
anticancer agent. Moreover, Lau and Sattely 
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show that the genes encoding these enzymes 
can be expressed in a different plant species 
to produce this etoposide precursor. The 
study clearly demonstrates the power of met- 
abolic pathway discovery and genetic engi- 
neering to make not only naturally occurring 
compounds, but also natural product analogs 
with enhanced pharmacological value. 

Derivatizing unnatural analogs from 
natural plant compounds has had success- 
ful outcomes in the development of anti- 
cancer drugs. For example, camptothecin, 
which is produced by the Chinese “happy 
tree” {Camptotheca), is a potent inhibitor 
of topoisomerase, an enzyme that controls 
DNA integrity during the cell division cycle. 
However, camptothecin is too insoluble for 
clinical use. Its derivatization with an amino 
group yields topotecan, an anticancer agent 
currently used in the clinic to treat ovarian, 
cervical, and small-cell lung cancer {3). Even 
when natural compounds are clinically im- 
portant, chemical derivatization can broaden 
the spectrum of applications or mitigate side 
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effects. Derivatives of the plant-derived anti- 
cancer agent taxol [isolated from the Pacific 
yew tree {Taocm brevifoliay] and vinblastine 
[isolated from the Madagascar periwinkle 
{Catharanthus roseusy\ are used in the clinic 
in addition to the natural compounds (4, 5). 

Etoposide is chemically synthesized from 
the natural metabolite podophyllotoxin, a 
liganin that is produced in mayapple plants 
(d). The potent cytotoxic activity of podophyl- 
lotoxin suggested that it could be developed 
into an anticancer drug, but in practice, the 
compound proved to be too toxic for clinical 
use. In the 1950s, a program was launched 
to synthesize and screen semisynthetic podo- 
phyllotoxin derivatives that exhibited less 
toxic side effects. Etoposide, which was even- 
tually discovered from this effort, showed 
excellent results in clinical trials and was 
introduced to the United States drug market 
in 1983 (d). Etoposide has a different mode 
of cytotoxicity compared to the podophyllo- 
toxin precursor. Chemical derivatization is 
typically used to enhance pharmacological 
properties such as water solubility, but eto- 
poside reflects how subtle chemical modifi- 
cations can also dramatically change the 
biological mode of action. 



^...biosynthetic enzymes 
in mayapple...produce a... 
precursor to...an ‘unnatural’ 
anticancer agent.” 

Medicinally useful plant metabolites, in- 
cluding podophyllotoxin, are usually iso- 
lated from the producer plant and are often 
produced in low yields. Furthermore, many 
medicinal plants are difficult to grow or are 
endangered, as is the case for a number of 
Podophyllum species (7). Development of a 
renewable plant cell culture that generates 
sufficient yields of podophyllotoxin has been 
unsuccessful (8). An emerging approach for 
sustainable production of plant-derived com- 
pounds is reconstitution of the biosynthetic 
genes in a heterologous host to recreate a 
functional pathway. Many challenges must be 
overcome to reconstitute these complex bio- 
synthesis pathways, but perhaps the largest 
hurdle is finding the relevant genes within a 
large, uncharacterized plant genome. 

Lau and Sattely identified missing steps in 
podophyllotoxin biosynthesis by tracking the 
expression levels of mayapple genes. Because 
podophyllotoxin increases in response to leaf 
wounding, comparison of RNA sequence 
data sets from wounded and unwounded 
leaf tissue pinpointed appropriately upregu- 
lated genes. Lau and Sattely filtered the pool 
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of gene candidates by considering only up- 
regulated genes that encode four enzyme 
classes that are predicted to carry out spe- 
cific types of chemical reactions required 
for podophyllotoxin biosynthesis. Once this 
set of candidates was identified, the genes 
could be transformed into the heterologous 
host Nicotiana henthamiana, a fast-growing 
relative of tobacco for which a robust pro- 
tein expression system has been developed 
(9). Lau and Sattely expressed multiple gene 
candidates at once in this host plant and 
identified the resulting compounds in leaf 
tissue by mass spectrometry. This untargeted 
metabolite profiling approach, in which all 
compounds resulting from candidate gene 
expression are identified, proved critical for 
characterizing unexpected, but highly valu- 
able, chemical reactions that are catalyzed 
by these candidate gene products. Expres- 
sion of one candidate gene consumed the 
biosynthetic intermediate (-)-deoxypodo- 
phyllotoxin, but the expected hydroxylated 
product was not detected, and instead, a 
demethylated product was observed. A sec- 
ond candidate gene did hydroxylate (-)-de- 
oxypodophyllotoxin in the predicted position 
but 5 delded unexpected stereochemistry. Lau 
and Sattely realized that these unpredicted 
enzymatic reactions could be used to pro- 
duce a more direct precursor for etoposide. 
Expression of these two genes, plus an addi- 
tional eight biosynthetic genes and a small- 
molecule precursor, in the N. henthamiana 
host produced (-)-4'-desmethylepipodophyl- 
lotoxin, a compound that can be converted 
to etoposide in fewer chemical steps than are 
required for the podophyllotoxin precursor. 

Although discovery of biosynthetic genes 
from plant metabolic pathways remains a 
challenging prospect, the elucidation of 
this pathway by Lau and Sattely demon- 
strates how far the field has come. The use 
of a heterologous host as a gene discovery 
tool ensures that gene candidates can be 
examined in the context of other pathway 
enzymes, which in this case was critical for 
revealing unexpected chemistry carried out 
by biosynthetic enzyme candidates. Rather 
than produce podophyllotoxin, Lau and 
Sattely used a specific combination of en- 
zymes to synthesize a molecule more valu- 
able than the initial target. ■ 
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QUANTUM MECHANICS 

Interference of 
atomic clocks 

The time dilation of gravity 
is mimicked with atomic 
clocks in magnetic fields 

By Markus Arndt and Christian Brand 

W hen de Broglie predicted that we 
need to associate a periodic phe- 
nomenon with any isolated portion 
of matter or energy (7), this idea 
became the basis of Schrodinger’s 
wave equation and modern matter- 
wave interferometry. It has stood the test of 
time and inspired intriguing discussions on 
the relation between quantum physics, classi- 
cality {2), and general relativity theory (GRT) 
{3-5). It also inspired the recent work on page 
1205 of this issue by Margalit et al. (6), who 
demonstrated that the internal clock of a de- 
localized atom can be used as a witness of the 
atom’s path through a matter-wave interfer- 
ometer. The study shows Bohr’s complemen- 
tarity principle in action and how dephasing 
in an external potential may mimic “classi- 
cality” even though the underlying quantum 
correlations can be erased and reversed. 

In 1908, Einstein predicted that the proper 
time of any clock on Earth will change with 
the local gravitational potential by one part 
in 10^® per meter height difference. This ef- 
fect is tiny, but it already needed to account 
for the comparison of optical clocks with a 
height separation of 5 cm (7). If we take de 
Broglie’s idea of a periodic phenomenon in 
any lump of matter literally, does that imply 
that even “a rock is a clock?” Could this boost 
precision measurements of gravitational time 
dilation in an atom interferometer where the 
center-of-mass wave function of an atom is 
delocalized over a certain height (3)? 

Although it has been argued that time 
dilation will remain unobservable in experi- 
ments with simple “rocks” as probe particles 
(4), interesting effects are expected (5) when 
the rock is replaced by a clock, i.e., a particle 
with internal dynamics (see the figure, panel 
A). Even if the two center-of-mass wave func- 
tions of the same atom are perfectly in phase 
and suitable for high-contrast matter-wave 
interference, the internal atomic clocks may 
still tick at different rates in each arm. 
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Because the total atomic wave function is 
the product of both the internal and the cen- 
ter-of-mass state, the overall interference will 
vanish if the internal states along two inter- 
ferometer branches evolve to be orthogonal. 
In agreement with established decoherence 
theory (8), the internal clock gets entangled 
with the position of the atom and acts as a 
witness of the path the atom takes in the field. 
This mathematically expresses Bohr’s comple- 
mentarity principle, that an experiment can 
reveal high-contrast interference or unambig- 
uous which-path information, but not both. 
Dephasing by gravitational time dilation is 
only expected to be maximally destructive 
if an atomic clock of frequency co can be de- 
localized over a vertical separation H in the 
gravitational acceleration g over time T, with 
(o-H-T > TTC^/g — 3 X 10^® m (5), where c is the 
speed of light and g = 9.81 m/s^ is the Earth’s 
gravitational acceleration. Although modem 
atom interferometers already achieve spec- 
tacular coherence times and beam separa- 
tions (9), the predicted effect is still too small 
to be measurable today. 

This challenge inspired Margalit et al. to 
build an experimental analogy that can il- 
lustrate the underlying complementarity 
principle, as well as the role of a clock as a 
^ which-path witness in atom interferometry 
I (see the figure, panel B). They start from a 
I small Bose-Einstein condensate (BEC) of 
I mbidium-87 atoms. A sequence of radio- 
^ frequency (RE) and magnetic-field gradient 
I pulses allows the ensemble to split coher- 
I ently into two. The naturally expanding ul- 
^ tracold atomic clouds then expand further 
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Relativity and complementarity. (A) The original proposal for testing an atomic clock as a “which path” witness uses 
gravitational time dilation in atom interferometry (5). (B) An analog experiment on atom interference with an internal 
clock (6), where each atomic branch is exposed to a different magnetic field rather than a different gravitational 
potential. The evolution of time is mimicked by the evolution of a clock’s phase. Starting from a single Bose-Einstein 
condensate, a field-gradient beam splitter generates two spatially separated clouds of rubidium-87 atoms at zero 
relative velocity, which both expand and overlap to interfere without the need of any further mirror or beam splitter. An 
RF pulse creates a coherent superposition of two Zeeman levels in the same hyperfine state. Their relative phase is the 
hand of the clock. Its orientation varies-it changes at different heights -with a magnetic gradient pulse of duration T^. 
(C) High matter-wave interference contrast is observed when the clocks are synchronized. The fringes vanish when the 
hand of the same clock on the same atom finally points in opposite directions in the two arms. 



to finally overlap and interfere. Because the 
atomic locations within these two clouds are 
fundamentally indistinguishable, the atomic 
matter-wave amplitudes can interfere to cre- 
ate a characteristic set of fringes in the atom 
number density. 

This indistinguishability and interference 
may, however, be broken if path information 
is available in the internal atomic state. For 
that to happen, Margalit et al. used an RE 
pulse to start the clock by creating an iden- 
tical superposition of the sublevels mp = 1 
and TTZp = 2 of the same hyperfine state F =2 
in both arms. Their clock (d) ticks about 10 
orders of magnitude more slowly than mod- 
ern optical clocks (7) and cannot test GRT. 
However, its low frequency provides excel- 
lent control over the internal state, which is 
important here to visualize the essential role 
of the path witness. 

Both branches see the same proper time, 
but to simulate aspects of the envisioned fu- 
ture tests of GRT, a vertical magnetic-field 
gradient provides different magnetic fields 
at different heights to drive the magnetic 
clocks out of phase. When the clock hands 
point in opposite directions, matter-wave 
interference vanishes. A longer interaction 
time in the external magnetic field allows 
the clocks to rephase. The path informa- 
tion is erased, the witness eliminated, and 
matter-wave interference is restored (see the 
figure, panel C). 

Complementing earlier demonstrations in 
neutron {10) and atom interferometry {11), 
the experiment of Margalit et al. shows the 
periodic and reversible dephasing of mat- 



ter waves that occurs when an atomic clock 
is used for which-path labeling. It could be 
argued that the underlying importance of 
quantum indistinguishability can even be 
visualized in a Mach-Zehnder interferometer 
with light— if the atomic clock in panel A of 
the figure was replaced by polarized light 
and the polarization was rotated in one arm 
with the magnetic field in a Faraday rota- 
tor. However, a superposition of a massive 
clock in each interferometer arm has a much 
farther-reaching potential, in principle. In 
the long run, it may test the role of time as 
a universal parameter in quantum mechan- 
ics, in contrast to GRT, where it is a dynami- 
cal variable. Furthermore, also the original 
proposal for matter-wave interference as a 
witness for time dilation in the Earth’s po- 
tential (5) may be tested. For that to happen, 
experiments will have to be pushed by many 
orders of magnitude over the current state- 
of-the-art in coherence time, path splitting, 
and clock frequency by combining the best 
optical clocks (running at 3 x 10^^ s"0 with 
beam separations and coherence times on 
the order of 1 m and 10 s. ■ 
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CLIMATE CHANGE 

Winning coalitions for climate policy 

Green industrial policy builds support for carbon regulation 



By Jonas Meckling/’^* Nina Kelsey, Eric 
Biber/’^ John Zysman®’^’^ 

T he gap is wide between the impli- 
cations of climate science and the 
achievements of climate policy. Nat- 
ural sciences tell us with increasing 
certainty that climate change is real, 
dangerous, and solvable; social sci- 
ences report that key constituencies largely 
support action. But current and planned 
policy remains weak and will 
POLICY allow a long-term increase in 
temperature of 3.6°C (i). How 
can we address the gap between science 
and policy? From the political successes 
of climate policy leaders, we identify key 
strategies for building winning coalitions 
for decarbonization of domestic economies. 
Green industrial policy provides direct in- 
centives for growth of green in- 
dustries, which builds political 
support for carbon regulation. 

Policy-makers and scholars 
increasingly think that global 
climate agreements emerge 
from aggregating bottom-up, 
domestically driven policies 
rather than from top-down 
negotiations { 2 ). But research 
does not explain what drives 
bottom-up approaches. How do 
we create and maintain political 
and entrepreneurial will for the 
fundamental transformations 
in our economy, infrastructure, 
and institutions needed to de- 
carbonize our energy systems? 

Empirical research on actual 
decarbonization strategies gives 
an answer: Providing economic 
benefits supports effective policy-making in 
a way that penalizing industrial polluters 
does not { 3 - 6 ). Green industrial policy cre- 
ates and enhances low-carbon industries, 
which brings economic constituencies into 
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coalitions for decarbonization, as well as 
giving feedback that drives progress toward 
more comprehensive climate policy. 

CARBON PRICING, MARGINAL 
CHANGE. This dynamic forces us to re- 
evaluate assumptions underlying carbon 
emissions policy-making through the lens of 
coalition-building. Economists favor directly 
regulating emissions by putting a price on 
greenhouse gas (GHG) emissions. Pricing 
carbon through a tax or a cap-and-trade 
scheme is, theoretically, the most efficient 
solution. But there are political barriers to 
implementing an effective carbon price. To 
date, one regional, 38 national, and 21 sub- 
national jurisdictions have, are scheduled 
to implement, or are considering a carbon 
price (7). Although half of these schemes are 
operating, realizing all of them would cover 



only 12% of global GHG emissions. 

Progress is slow because carbon regula- 
tion imposes costs on the powerful few— 
well-organized energy and energy-intensive 
manufacturing firms— and provides dis- 
persed benefits to the weak many— the 
broader public ( 8 ). The few regulatory los- 
ers have greater incentives and capacity to 
organize politically and prevent policy im- 
plementation. So policies focused on impos- 
ing a cost on carbon often fail. Even when 
carbon pricing schemes are created, they 
accommodate the demands of polluters ( 9 ), 
which renders them only marginally effec- 
tive. Without political support, carbon pric- 



ing tends to be a weak instrument at best. 

As such, carbon pricing may be ineffec- 
tive for cultivating coalitions for stron- 
ger low-carbon policy. It favors least-cost 
changes, but costly moves, like major new 
capital investments, underlie corporate in- 
terest realignment. Current (weak) pricing 
schemes tend to effect marginal changes, 
such as supplementary equipment and 
fuel switching in existing infrastructure, 
although there is also a modest increase in 
patenting ( 10 ). Strong carbon pricing would 
likely drive more fundamental changes but 
is politically costly, whereas more direct 
measures, such as renewable portfolio stan- 
dards (RPSs), have had more support. 

COALITIONS AND FEEDBACK. Un- 
like carbon pricing, green industrial policies 
have proliferated since the late 1980s. Such 
policies provide concentrated 
benefits to the few and well- 
organized, such as renewable 
energy firms, low-carbon indus- 
tries, and investors. By 2013, at 
least 132 countries and subna- 
tional entities had enacted a 
feed-in tariff (FIT), an RPS, or 
both ( 11 ). 

Economists view such mea- 
sures as third- or fourth-best 
options for efficiency ( 12 ). But 
research in political science 
and law suggests that green 
industrial policy nurtures a po- 
litical landscape of interests and 
coalitions that benefit from a 
transformation to low-carbon 
energy use— even when pollut- 
ing industries might oppose it. 
The widespread adoption of 
FITS and RPSs and the fact that they often 
precede carbon pricing, confirms this (see 
the figure). Green industries are political 
allies in the development of more stringent 
climate policy that subsequently penalizes 
incumbent polluters. Carrots buy sticks. In 
contrast, Canada and the United States both 
failed to create cap-and-trade schemes and 
lacked strong, prior federal renewable en- 
ergy policy. 

Winning coalitions thrive on positive 
feedback ( 13 ). The more green industries 
form or expand, the stronger coalitions for 
decarbonizing energy systems become, and 
the easier it gets to install stronger or more 
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Sequencing in climate policy. Of the 54 countries and subnational entities that 
adopted a carbon pricing scheme by 2013, nearly two-thirds installed a FIT or RPS 
before setting up the pricing regime. Data from (7, 11). 
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*Of these 19, 13 are either late entrants to the ED that “inherited” carbon pricing when they 
joined, or countries that already had naturally low-carbon energy mixes (percentage of 
nuclear -i- hydroelectric power substantially above global share). 
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comprehensive regulatory strategies. Policy 
leaders have used feedback processes before 
implementing carbon pricing. For instance, 
German policies began with funding for re- 
search and development, then subsidies for 
demonstration projects during the 1970s and 
1980s, and continued to larger-scale market 
formation programs, including a feed-in law 
after the 1986 Chernobyl disaster. These 
policies led to industrial expansion in wind 
and, later, solar energy production; these 
developments helped create and expand a 
coalition of interests that fought to defend 
existing measures and supported further 
measures {14). California, too, demonstrates 
feedback effects {15): Early measures re- 
sponding to pollution and oil crises led to 
(i) the creation of a strong regulatory in- 
frastructure; (ii) efficiency regulations; (iii) 
decoupling of profits from sales volume for 
utilities; and (iv) early support for renew- 
ables. Those measures created tolerance 
for regulation and set the stage for the pas- 
sage of a renewable portfolio standard and 
GHG reduction legislation that ultimately 
resulted in an emissions trading scheme. 
Denmark has a similar story {4). 

A political strategy that emphasizes 
green industries may raise concerns that it 
is vulnerable to costs from rent-seeking (ob- 
taining economic gain from others without 
benefitting others) and regulatory capture. 
However, a feedback-based strategy broad- 
ens political support, which can effectively 
lead energy systems out of carbon lock-in. 
And as Rodrik {16) convincingly argues, 
rent provision can be managed to prevent 
capture of policy-makers by winners. More- 
over, it is possible that these “third-best” 
policy options may avoid future costs, by 
speeding up progress toward more ambi- 
tious emission cuts. 

POLICY IMPLICATIONS. Although 
some governments have— often unintention- 
ally-built winning coalitions, policy-makers 
should use the approach more strategically 
to cut emissions across power, transport, 
and heating sectors and as other countries 
adopt their initial climate policies. We pro- 
pose three key strategies: (i) adopt initial 
policy suites of targeted sector-specific poli- 
cies; (ii) send direct, high-leverage policy 
signals rather than broad, shallow ones; and 
(iii) sequence policies strategically. 

First, multiple policies— narrow sector, 
technology, and region-specific— are effec- 
tive for initiating a trend toward decarbon- 
ization because they provide concentrated 
benefits and can link climate policies with 
local issues. Targeted green industrial poli- 
cies-like subsidies, tax rebates, and renew- 
able energy standards— provide concrete 
benefits to firms and households. Specificity 
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makes them politically bounded and rela- 
tively easy to understand— unlike broader, 
more systemic strategies, such as carbon 
pricing or urban planning reform. They can 
be tailored to provide side benefits and to 
balance different demands. Linking issues 
allows for greater leverage in policy-mak- 
ing {4). For example, the European Union’s 
(EU’s) climate policy mix aimed at reducing 
emissions and also played to concerns about 
energy security and national competitive- 
ness. The broad policy suite allowed policy- 



‘‘The more green industries 
form or expand, the stronger 
coalitions for decarbonizing 
energy systems become.” 

makers to link emission cuts to other key 
energy-related policy goals, such as reduc- 
ing dependence on Russian gas and creat- 
ing export opportunities. In early phases of 
policy-industry feedback, when political will 
for climate policy per se is still limited, all 
these qualities are important. 

Second, policy signals need to have high 
leverage, directly tied to concrete, mean- 
ingful changes in industry investment or 
structure. Relative to weak carbon pricing, 
policy instruments like FIT or RPS provide 
comparatively strong direct incentives for 
growth of cohesive green industry groups 
and, thus, are most likely to drive initial 
shifts in investment and revenues that can 
realign interests in industries {17). This re- 
alignment expands coalitions for low-carbon 
policy and provides support for experimen- 
tation with policy and technology, as well 
as progress toward systems transforma- 
tion. Marginal changes encouraged by weak 
economy-wide signals like carbon-pricing 
will not accomplish this goal. 

Third, strategic sequencing of policies 
matters. Initially, climate policy must cre- 
ate constituencies that provide the support 
for subsequent policy moves. Early high- 
leverage measures are particularly likely 
to mobilize support. They also prove to 
be politically stable and lasting given sup- 
port from the constituencies and coalitions 
they create. For instance, several efforts in 
US. states to roll back RPSs have failed in 
recent years, in part owing to political op- 
position from beneficiaries of those policies. 
Over time, broader policy signals targeted at 
polluters, like carbon prices, can be intro- 
duced and strengthened. The more carbon 
policy is politically entrenched, the more 
discretion there is for less-targeted policy 
that is more efficient. Strategic sequencing 



requires adaptive policy design— focused 
on knowledge exchange, discipline, and ac- 
countability (id)— to prevent lock-in of tech- 
nical-institutional paths that fail to increase 
political support and/or to decarbonize the 
energy system (e.g., ethanol policy). 

Future research needs to note the context 
of successful strategies and to specify poten- 
tial policy interventions, particularly as they 
vary by locale. For instance, what type of 
policy sequences work in different types of 
political or energy systems? How can policy- 
makers avoid dead ends and maintain flex- 
ibility for adjusting policy measures? How 
can policy-makers best balance needs for 
politically salient and economically efficient 
policy interventions? When are policy-mak- 
ers likely to retrench from decarbonization 
trajectories? How will opposing “brown” co- 
alitions adapt over time to strategies based 
on these concepts? 

Over the past year, climate change has 
risen on the global agenda. The US. climate 
action plan, the United States-China deal, 
and the EU’s 2040 targets are key develop- 
ments. The December climate meeting in 
Paris may deliver more. The real test for ef- 
fective climate policy will be the extent to 
which governments are capable of building 
and growing domestic coalitions for low- 
carbon energy that support implementation 
and strengthening of those international 
commitments over time. ■ 
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Rethinking heritability of the microbiome 

How should microbiome heritability be measured and interpreted? 



By Edward J. van Opstal^ 
and Seth R. Bordenstein^ ^* 

F or almost a century, heritability has 
been routinely used to predict genetic 
influences on phenotypes such as in- 
telligence, schizophrenia, alcoholism, 
and depression (i). However, there has 
been relatively little work on heritabil- 
ity of the human microbiome— deflned here 
as the number and types of microorganisms 
and viruses present in or on the human body 
This question has become increasingly more 
interesting as research reveals that humans 
and their microbial communities interact 
in complex and often beneficial networks. 
An underlying question is the degree to 
which environment versus human geno- 
type influences the microbiome. A cen- 
tral goal of quantifying microbiome 
heritability is to discern genetic from * 
environmental factors that structure 
the microbiome and to potentially 
identify functionally important mi- 
crobial community members. 

Twin-based studies pro- 
vide one method for 
quantifying heritability 
(/i^) of microbial taxa. In 
such analyses, heritability 
is measured by comparing 
variation in microbial taxon 
abundances that is attribut- 
able to human genetics. This approach sim- 
plifies microbial abundances to continuously 
varying phenotypes, comparable to human 
height, weight, and eye color. In 2009 and 
2012, studies of twins conducted in this man- 
ner concluded that there are no heritable gut 
microbial members (2), or low overall gut 
microbiome heritability (5), respectively. But 
in 2014, the largest twin cohort to date ex- 
amined members of the gut microbiome and 
found that the bacterial family Christensenel- 
laceae has the highest heritability (/^^ = 0.39), 
and associates closely with other heritable 
gut bacterial families (4). The ground- 
breaking discovery of a high heritability for 
members of the human microbiome raises 
speciflc questions about understanding the 
genetics of human-microbe symbioses: How 
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should we interpret what heritability means 
for microbiome studies? Can microbiome 
heritability be viewed in a more comprehen- 
sive manner? Is the only term to measure 
and denote microbiome heritability? 

Under a heritability analysis with standard 
statistical approaches [such as the Additive 
Genetics, Common Environment, Unique En- 
vironment (ACE) model], the abundance of 
each human-associated microbe is presented 
as a continuously varying, quantitative “trait” 
that is affected by host genetics— in other 
words, the host genome signiflcantly dictates 
the abundance of a microbe. Although a suit- 
able starting point, this host-centric inter- 
pretation of microbiome heritability tends 
to consider the human-microbiome inter- 
action as unidirectional, in which the 
host regulates colonization. This view, 
however, is only half of the story. The 
microbiome is a collection of differ- 
ent organisms with genotypes that in- 
teract with each other as well as with 
the host to achieve colonization. A 
more comprehensive view is advis- 
able in which both the host 
and the microbiome play 
a role in heritability. This 
view, based on community 
genetics principles, requires 
that studies adopt a concep- 
tual foundation of interspe- 
cies (genotype-by-genotype) 
interactions that drive the assembly of the 
host and microbial consortia. It also necessi- 
tates the use of a measure— “community heri- 
tabilit}^’ (i7^^)— that reflects genetic variation 
underlying interactions with the entire (or 
portions of the) community— in this case, the 
microbiome together with its human host. 

Human-associated microbes contain dis- 
tinct genetic, transcriptomic, metabolic, and 
proteomic features that can reciprocally in- 
fluence their own colonization of speciflc 
human genotypes. These features span com- 
petitive nutrient acquisition, mechanisms to 
evade the host immune system, and niche 
construction, among others. Thus, heritable 
taxa such as Christensenellaceae may be “re- 
cruited” by the human genome to perform 
beneflcial functions in the microbial com- 
munity, but they also may encode traits that 
enable them to circumvent host defenses and 
colonize susceptible genotypes. For example, 
commensal bacteria tolerate or evade human 
immune responses by modifying surface 
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components in their cell walls and outer 
membranes (5). Other microbes proactively 
subvert the immune system by injecting ef- 
fector proteins into host immune cells to kill 
them (6). Microbes may alter the expression 
of genes involved in regulating immune, de- 
velopmental, and metabolic functions {7-10). 
Communities of microbes can also alter their 
human environment by forming internal 
biofilms. Biofilms can provide protection 
against the host immune response, afford 
stability in fluctuating environments, and 
promote the survival of a microbial commu- 
nity {11). Put simply, microbiome heritability 
can represent three outcomes: host control 
of the microbiome, host susceptibility to the 
microbiome, or a combination of host control 
and susceptibility Interspecies interactions 
(host-microbe and microbe-microbe) thus 
underlie the emergent property of microbi- 
ome heritability 



Standard heritability 




Genetic relatedness 
among twins 



0 ^ 0 ^ 






Microbial species 
abundance 



Community heritability 




Genetic relatedness 
among individuals 




Microbial 

community 

structure 



Analyzing heritability. Under standard heritability 
(/?2) analysis, each microbe is a quantitative “trait" that 
is encoded by host genetics. Comparing monozygotic 
versus dizygotic twins allows an indirect estimate of 
the additive genetic factors affecting that microbe’s 
abundance. Community heritability {H^^) emphasizes 
that the host is part of the ecosystem and measures the 
extent to which whole community phenotype variation 
is due to genetic variation in the host. It specifies that 
host genetic variation will have predictable effects on 
microbial community assembly. 



Human-microbiome interactions tend 
to be viewed through the lens of host regu- 
lation. A key point when discussing com- 
^ munity-wide symbiotic interactions is that 
g heritability denotes host involvement rather 
I than control of microbiome assembly. Thus, 
I human genetic effects on the microbiome are 
^ most easily understood as genotype-by-geno- 
I type (i.e., hologenomic) interactions between 
I the host and other microbial members {12). 
^ Without knowing the mechanisms underly- 



ing the heritability of a certain microbial 
member, we must be cautious in interpreting 
whether a highly heritable microbe in any 
symibiosis is harmful, harmless, or beneficial 
to the whole system. 

In addition to taking a comprehensive view 
of microbiome heritability, it is important to 
understand the ecological genetics principle 
of broad-sense community heritability {H^^) 
{13). IP^ emphasizes that the host is part of 
an ecosystem and measures the extent to 
which variation in “whole-community” phe- 
notype is due to genetic variation in the foun- 
dation (i.e., host) species of the community. 
It therefore specifies that host genetic varia- 
tion will have predictable effects on microbial 
community assembly {14), in addition to hav- 
ing effects on specific members of the micro- 
biome, as measured by 

The whole-community phenotype is mea- 
sured by ordination methods that cluster 
microbial community data into single scores 
used to compare compositional differences 
(i.e., p diversity) between the communities 
of various hosts. One such data clustering 
method is nonmetric multidimensional scal- 
ing (NMDS). Analysis of NMDS scores (by 
ANOVA, which tests whether there is sig- 
nificant variation in means among groups, 
among subgroups, within groups, etc.), would 
then identify the fraction of total variation in 
whole community phenotype that relates to 
genetic variation in the host {14). 

The utility of this approach in addition 
to conventional heritability measurements 
is that it incorporates the vast interspecies 
interactions that contribute to a whole com- 
munity phenotype, instead of considering 
individual microbial members as phenotypic 
extensions of the host. If these interactions 
have a heritable component {H'^^), then the 
assembly of the community is nonrandom 
(i.e., via ecological selection). For host-micro- 
biome symbioses, this has been referred to as 
“phylosymbiosis” (i5). Given a signficant 
selection as an evolutionary force can poten- 
tially act on the community. 

Presently, community heritability mea- 
surements have only been applied to eco- 
logical systems analyzing plant genetic 
influences on soil microbe, arthropod, bird, 
and mammal community structures {14). 
Human microbiome studies could adopt 
to determine whether human genetic 
variation across the whole genome, or cer- 
tain functional categories of genes, affects 
microbial community assembly. Twin-based 
microbiome studies could derive a NMDS 
score for the whole microbial community or 
from specific taxonomic levels to test if there 
is a significant association with human ge- 
netic relatedness across the cohort of twins. 
Further, transcriptomes, metabolomes, and 
proteomes could potentiate identification 



of the candidate genes that affect microbi- 
ome heritability. Data integration from large 
“omics” data sets holds the potential to move 
the community genetics view of host and mi- 
crobes from unspecified genetic effects on 
interacting species to precise gene-by-gene 
interactions. 

The products of host genes work in in- 
timate association with the products of 
microbial genes to enable functioning of 
the whole holobiont {14). Strong degrees 
of microbiome heritability could therefore 
have profound consequences for the ecol- 
ogy and evolution of human and all animal 
and plant holobionts. A crucial outcome of 
this community heritability view is to un- 
derscore the deterministic and predictable 
interactions between hosts and microbes. 
Further studies also need to consider the 
relative roles of vertical and horizontal 
transmission of microbial communities in 
heritability assessments. Thus, genetic anal- 
ysis of whole community organization is an 
important frontier in the life sciences, par- 
ticularly for fusing the fields of ecology and 
evolution and the taxonomic disciplines of 
zoology and botany with microbiology. The 
repurposing of community heritability from 
traditional macroecological systems (i.e., 
gardens) to microecological systems (i.e., 
human gut) will provide a more compre- 
hensive view to studies of microbiome heri- 
tability. This view looks outwards, beyond 
the phenotype, to examine links at higher 
interspecies levels and holds the potential 
to unify community ecology and evolution 
concepts. In the words of the microbiologist 
Carl Woese: “Biologists now need to refor- 
mulate their view of evolution to study it in 
complex dynamic-systems terms.” ■ 
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Symmetry, proportion, 
and economy are 
common features of 
art and of naturally 
occurring structures. 



AESTHETICS 

Beautijul minds 

A physicist and a philosopher contemplate the nature 
of beauty and the value of art 



By Giovanni Frazzetto 

» s living beings with curious, sophis- 
ticated minds, we marvel at nature 
and ponder our relationship with it. 
To make our investigation stimulat- 
ing, we endlessly develop and refine 
methods of discovery. Art, science, 
and philosophy are a few such methods that 
occasionally intersect, as they do in two elo- 
quent books that aptly complement each 
other: physicist Frank Wilczek’s A Beauti- 
ful Question and philosopher Alva Noe’s 
Strange Tools. 

A vivid, at times dense, journey through 
physics from Pythagorean geometry and 
Newton’s laws of dynamics to Einstein’s rela- 
tivity theory, quantum particles, and Wilc- 
zek’s own discovery of asjmiptotic freedom, 
A Beautiful Question seeks to determine 
whether the world embodies beautiful ideas. 
The question itself is not new, but Wilczek 
tackles it with remarkable audacity, bringing 
vast, compelling evidence for symmetry, pro- 
portion, and economy in nature’s own design 
as well as in science’s endeavors to expose it. 

Wilczek’s search for beauty in nature be- 
gins with a discussion of the natural philos- 
ophers of ancient Greece, who pursued the 
belief that matter is built from products of 
the mind and argued that beautiful forms 
reflect mathematical order. Indeed, some of 
the simplest creatures in our biosphere mir- 
ror the basic solid shapes described by Plato. 
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For instance, HIV and the herpes virus are 
perfect icosahedra. Radiolarians, ocean 
life-forms of ancient origins, equally in- 
carnate those symmetrical structures. Wil- 
czek further relishes the ineffable beauty 
in Maxwell’s electromagnetism equations, 
which he visualizes as a balanced “dance of 
concepts through space and time,” a move- 
ment he says is more easily experienced 
than explained. 

Wilczek’s enthusiasm in the book is pal- 
pable and contagious, in resonance with 
some of the concluding lines from his No- 
bel Prize lecture: “It is truly awesome to 
discover ... that we humans can come to 
comprehend Nature’s deepest principles, 
even when they are hidden in remote and 
alien realms.” If you ask Wilczek, a physi- 
cist’s work is surely driven by logical con- 
sistency. But that is not enough: Beauty is 
also needed to guide us toward truth. 

With incisive arguments and in crisp 
and engaging prose. Strange Tools brings 
the discourse on the function of art and 
beauty to a different level. Central to Noe’s 
book is the idea that, like philosophy, art 
is a tool to understand how we live. It is a 
practice— a technique— that, in his words, 
organizes and reorganizes us. 

Although it involves fundamental bio- 
logical qualities such as vision and other 
forms of perception, Noe argues that art 
is much more than biology. The act of see- 
ing is a basic or, as Noe calls it, first-order 
activity. On the contrary, art is a second- 
order activity. Although it cannot be dis- 
entangled from a biological substrate, it is 
enriched with meanings that emerge from 
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an identifiable cultural background. 

Contemplating a work of art can be a 
physical experience as well as an intellec- 
tual one. For example, visiting a sculpture 
by artist Richard Serra, who is famous for 
erecting labyrinth-like structures of steel 
and massive size, can be a disorienting ex- 
perience. It is difficult to distinguish front, 
back, or side and impossible to tell where 
the piece starts and ends. The disturbance 
is visceral. Noe argues that stepping into 
such spaces represents a “certain kind of 
enlightenment” that is akin to entering an 
entirely new intellectual world. 

Noe’s examples extend beyond the vi- 
sual world, arguing that music, too, elicits 
reactions in light of educated knowledge. 
That is why, for instance, the complexity 
of Beethoven may leave the musically un- 
versed indifferent— or why we often tend 
to have a privileged connection to the mu- 
sic of our own time. 

Against popular trends, Noe sides decid- 
edly against using neuroscience to eluci- 
date art— a position he lucidly upholds in 
a dedicated chapter. Neuroscience, he ar- 
gues, is not to be taken as an “intellectual 
readymade” that is pertinent to all phe- 
nomena— and especially not to art. Art, as 
he emphasizes several times in the book, 
is not a phenomenon like digestion or eye- 
sight. It is itself a mode of investigation. 

Noe’s book begins with a revealing epi- 
sode that ties the overall argument in his 
book with the leitmotiv of Wilczek’s inves- 
tigation. Asked by an artist about the sci- 
ence of visual perception, Noe responded 
that a vision scientist wonders how we get 
to see endless beauty in the world when it 
reaches us as warped upside-down images 
projected on our retinas. In other words, 
what a scientist asks is How do we see so ^ 
much based on so little? For the artist— § 

O 

who we later find out is Noe’s dad— the ^ 
right question to ask is exactly the oppo- ^ 
site: Why are we so blind, when there is so g 
much to see? § 

d 
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A new chemical economy 
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By Megan R. Schwarzman 

T he United States has a big problem 
when it comes to our production 
and consumption of chemicals— 
and we’ve been thinking about it 
all wrong. So argues Ken Geiser 
in his sweeping new tome, Chemi- 
cals Without Harm. Instead of decrying 
(and attempting to regulate) the health 
and environmental impacts of hazardous 
chemicals, Geiser argues that we need to 
see hazardous chemicals as the symptom 
of a much larger problem: a whole chemi- 
cal economy gone awry In addressing this 
new version of the problem, he issues a 
far-reaching scientific, technical, and cul- 
tural challenge: retool the material basis of 
our society toward a system that produces 
and uses dramatically safer chemicals. 

What starts as a thorough survey of 
key US. statutes governing the landscape 
of chemicals and products becomes a 
thoughtful investigation of how those laws 
have performed (mostly poorly) in the 
face of political maneuvering, procedural 
limitations, and small budgets. In search 
of solutions, Geiser places the chemicals 
problem within its technical context, ex- 
amining the complex system of feedstocks, 
processing steps, and product-manufactur- 
ing supply chains. He further implicates 
the current political climate, which he sees 
as hostile toward business regulation, as 
well as America’s cultural tradition of per- 
missiveness toward new technologies. 

Although jaded about the potential for 
the federal government to play a central role 
in the transformation to an economy based 
on safer chemicals (“The government that 
I studied and engaged with for some forty 
years has changed dramatically. There are 
no big visions and grand gestures now”), 
Geiser sees tremendous potential in a rich 
tapestry of other actors: nongovernmental 
organizations that aim to shift market forces, 
^ trade unions trying to improve workplaces, 
^ consumer goods manufacturers working 
^ on safer products, and state legislatures 
I adopting new chemical policies. In fact, op- 
5 timism and possibility run so continuously 
s through the book that if it were a song, 
^ its refrain might be “We can do it.” Geiser 

o 
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clearly believes that although a monumental 
transformation in the chemical economy is 
ambitious and runs counter to powerful and 
entrenched interests, a well-meaning society 
can and should make it a reality. 

Citing examples primarily from Europe, 
but also Asia, Geiser describes new, more 
comprehensive chemical policies that offer 
solid models, supported by chemical data, 
on which we might base our own programs. 
He argues that such policies are becoming 
de facto global standards. 

Geiser gathers examples of best prac- 
tices from around the world and in so do- 



ing has created a terrific resource. Want 
to catalog every chemical in commerce? 
Look to the European chemicals agency, 
where companies have registered more 
than 100,000 substances. Think we need 
a consumer-facing system for rating the 
chemical hazards in products? He points 
to GoodGuide, a free iPhone application 
that offers environmental ratings of over 
200,000 food items, toys, and household 
products. Geiser offers a vision of how 
each of these pieces might fit together to 
create a new, better functioning chemical 
economy. 

Although Geiser’s “yes, and” approach is 
welcome in the face of a challenge as ap- 
parently intractable as the current chemi- 
cal economy, some of his suggestions seem 



improbable. His substitute for government 
action— the blossoming of a million flow- 
ers, or in this case, crowd-sourced chemi- 
cal policy— is not always convincing. For 
example, he suggests that chemical manu- 
facturers could classify the hazard level of 
their own chemicals and that if the infor- 
mation were posted on the Internet, “its 
veracity could be managed by comments 
and critiques from the hundreds of con- 
tributors it most certainly would attract.” 
He notes that companies that make and 
distribute chemicals will need to reenvi- 



sion themselves as “valued agentfs] of 
environmental and health protection.” 
Particularly in light of the many stories 
Geiser tells of industry’s role in blocking 
much-needed change to existing industrial 
systems, the reader could be forgiven for 
mumbling, “Fat chance.” 

In the expanse of the book, though, we 
can tolerate some overly aspirational ideas. 
Geiser has gathered a wealth of informa- 
tion and applies it to the problem at hand 
in an imaginative way: offering a frame- 
work for how to reorient a hazardous and 
resource-intensive industry toward safety 
and sustainability. It is an impressive con- 
tribution— and a call to action. 

10.1126/science.aac9931 
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A technician collects water samples near the site of a chemical warehouse explosion in Tianjin, China, in August. 
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INSIGHTS 



Edited by Jennifer Sills 

Commercial 
forests: Native 
advantage 

THE REVIEW “PLANTED forest 
health: The need for a global 
strategy” (M. J. Wingfield et at., 

21 August, p. 832) suggests that 
preventing future forest pest 
problems requires integrating 
management approaches glob- 
ally. We agree, but with caveats. 

First, the dominant species 
used in plantation forestry (such as Pinus, 
Acacia, and Eucalyptus) rank among some 
of the worst biological invaders worldwide 
(i, 2). Efforts to improve plant performance 
by reducing pest and pathogen impacts will 
likely make those species more problematic 
and must therefore go hand-in-hand with 
approaches to mitigate their spread beyond 
plantations. The use of genetic modification 
to improve pest and pathogen resistance 
should be developed alongside techniques 
to ensure seed sterility for these species. 
Second, commercial forestry often requires 
the co-introduction of mutualistic organisms 
such as root-associated fungi and bacteria, 
which form symbiotic relationships with 
surrounding plants. These mutualists can 
themselves become invasive or interact 
with additional invasive species creating 
novel complex interactions {3, 4). Third, an 
increasing number of native insects and 
pathogens have shifted their host prefer- 
ence onto introduced plantation tree species 
(5) and attempts to manage these through 
biological control will face considerable 
regulatory hurdles. Fourth, the expansion 
of planted forests globally will increase the 
likelihood that new pests and pathogens will 
emerge in regions where commercial trees 
have been introduced and will pose new and 
uncertain risks to these trees in their native 
range. For example, myrtle rust is a signifi- 
cant pathogen of Eucalyptus plantations 
in Brazil that has now spread to Australia, 
potentially threatening a wide range of 
threatened native eucalypts (d). 

We suggest that rather than commercial 
forestry rel 3 dng on a relatively small set 
of plantation species and genotypes, more 
effort should be invested in developing 
native species as commercial alternatives to 
Pinus, Acacia, and Eucalyptus. Not only will 
this lessen the threat of biological invasion. 




Planted forests (left), invasion by forestry and horticultural tree species (center), 
and native forest (right) in New Zealand. 



but such planted forests will be established 
with the entire contingent of pests and their 
predators that might pose an ecological 
barrier to the invasion of alien pests and 
pathogens. Experiences with planted forests 
based on native species in Europe, Asia, 
and North America suggest that a strategy 
based on diverse native species adapted to 
local conditions may lead to a more robust 
long-term outlook for forestry, rather than 
a single global strategy reliant on a limited 
genetic diversity. 
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Bringing science to 
prisons is not enough 

C. J. LEROY describes her experience 
teaching science behind bars to improve 
cross-boundary science outreach (“Outside 
the tower: Bringing science inside prison 
walls,” Letters, 1 May, p. 511) and the 
importance of educating prisoners in envi- 
ronmental science (i). I established a science 
outreach course at the Champaign County 
Juvenile Detention Center (JDC) in Urbana, 
IL, that has since attracted several other 
graduate students who lead lecture discus- 
sions on neuroscience, evolutionary biology. 



and game theory, to name a 
few topics popular among the 
13- to 17-year-old students. We 
can all agree on the importance 
of bringing STEM to marginal- 
ized populations (2), but what 
good is a positive attitude or 
an interest in engaging science 
to marginalized populations 
of students who are system- 
atically excluded from higher 
education? The social stigma of 
imprisonment will follow these 
students, and the obstacles 
present before incarceration will 
not be removed after release, 
regardless of how engaged and 
thoughtful their comments 
are in class. The hunger for 
knowledge that LeRoy notes is 
inspiring, and I find hope in my 
students’ thoughtfulness, but no aspect of 
outreach can alone compensate for the lack 
of alternatives they experience on the out- 
side and will once again face upon release. 

It is difficult to commit to STEM enrich- 
ment without collaterally working to affect 
social change in the institutions that con- 
tribute to educational segregation and the 
policies that disproportionately cause harm. 
Until we broadly address these injustices, 
some students will be able to mature along a 
path to success paved by their communities, 
while others who have been shunted from 
school to prison, regardless of potential, will 
be out of luck. 

A collaboration of scientists and social 
activists built around the central issue of 
equality is needed to develop a system of 
inclusion that addresses the educational 
needs of all students, regardless of privilege 
or economic (dis)advantage. Collaterally, 
we should all continue to bring science to 
where it is needed most, and to learn what 
I quickly learned at the JDC: Concrete and 
steel separate my students from me, not the 
capacity to think like a scientist. 

Ian Traniello 

Carl R. Woese Institute for Genomic Biology, 
University of Illinois at Urbana-Champaign, 
Champaign, IL 61801, USA. E-mail: 
traniel2@illinois.edu 
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Illuminating next 
steps for NEON 

IN HIS 7 August In Depth News story “The 
dimming of NEON” (p. 574), J. Mends 
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reported cuts to the National Science 
Foundation’s (NSF’s) National Ecological 
Observatories Network (NEON) program. 
NEON is a highly ambitious research 
program that attempts to observe biological 
and ecological phenomena at unprec- 
edented scales (7). We served as science 
advisers, evaluating a series of budget and 
technical problems that compromised the 
envisioned capabilities of the Observatory. 
We recommended changes that will leave an 
Observatory of extraordinary value, provide 
continental coverage, and deliver transfor- 
mational science (2). 

Mervis focused on STREON, one of the 
most important conceptual components 
of NEON despite accounting for only a 
few percent of the budget. The STREON 
design adds an otherwise missing element 
to NEON by altering stream ecosystems, 
and NSF should identify a funding path to 
restore it. Deferment of STREON places 
crucial science at risk. 

Despite his generally upbeat appraisal 
of the Observatory, Mervis raised serious 
concerns of a broader nature, including 
“high staff turnover and conflicts created by 
giving scientists what they felt was insuf- 
ficient authority.” NSF’s model for project 
management emphasizes early science 
involvement in the design, but execution by 
project management professionals (3). This 
approach limited NEON’s ability to adapt 
scientifically as unforeseen technical and 
scientific challenges arose. 

Many of the challenges that derailed 
NEON were evident long ago, and some 
could have been addressed by peer-to-peer 
engagement between science and project 
management. Re-engaging the scientific 
community at this time will provide benefits 
for NSF, NEON, and the broader ecological 
community. A renewal of science leadership, 
and scientific peer review, are required to 
lead to final project success, allowing NEON 
to realize its benefits to science and society. 
Todd Dawson,^ Serita Frey,^ Eugene F, 
Kelly, ^ Susan Stafford,^ David SchimeF* 
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Missing methylation or cohesin 
leads to chromosome instability 

Tanno et al., p. 1237 
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Female crickets need 
only five neurons to 
recognize male songs 



NEUROSCIENCE 



How crickets 
communicate by sound 

S ound patterns present a computational 
challenge in all animals that commu- 
nicate by sound. Schdneich unraveled 
the processing of male calling songs 
by female crickets. A local network 
of only five neurons in the brains of the 
females forms an auditory feature detector 
circuit, with a response tuning that matches 
the selectivity of the female behavioral 
responses to the calls. — PLY 



Sc/. Adv. 10.1126/ sciadv.1500325 (2015). 



IN JOURNALS 



CRITICAL PHENOMENA 

Vortices in a 

superconducting egg crate 

Near equilibrium phase transi- 
tions, physical systems that bear 
no resemblance to one another 
can behave in a very similar 
way. For example, thermody- 
namic functions follow the same 
scaling behavior in a magnetic 




transition as in the seemingly 
unrelated gas-liquid transition. 
Does such universality exist in 
nonequilibrium phase transi- 
tions? Poccia etal. fabricated a 
square array of superconducting 
islands on a metallic surface. 
They applied a magnetic field, 
which caused vortices to form 
in between the islands, and 
induced a transition from a 
state in which vortices were 




Three-dimensional image of the dynamic resistance as a function of the driving 
current and the applied magnetic field 



stuck to their positions to one 
where they were able to move. 
They observed the same scaling 
behavior that applies to some 
equilibrium transitions. — JS 

Science, this issue p. 1202 



STRUCTURAL BIOLOGY 

Structure and function 
of the spliceosome 

When RNA is transcribed from 
DNA in the eukaryotic cell 
nucleus, the initial transcript 
includes noncoding introns 
that must be spliced out. This 
splicing is done by a complex 
macromolecular machine, the 
spliceosome, which comprises 
five small nuclear RNAs and 
more than 100 associated 
proteins. Now, two papers 
reveal insights into the struc- 
ture and function of the yeast 
spliceosome. Yan et al. describe 
a high-resolution structure 
determined by electron micros- 
copy of a spliceosome complex 
comprising four RNAs and 37 
proteins. Hang et al. focus on 



the catalytic site and show how 
protein components anchor the 
transcribed RNA and allow suf- 
ficient flexibility to deliver RNA 
components involved in catalyz- 
ing the splicing reaction. — VV 
Science, this issue pp. 1182 and 1191 



PLANT SCIENCE 

Transplanting the 
wisdom of the mayappie 

Etoposide, a topoisomerase 
inhibitor, is used to treat 
various cancers. However, 
etoposide isn’t that easy to get. 
Its precursor comes from the 
very slow-growing mayappie 
plant. Lau and Sattely used 
bioinformatics, heterologous 
enzyme expression, and kinetic 
characterization, to work out 
the pathway that makes the 
precursor in mayappie (see the 
Perspective by O’Connor). They 
then successfully transplanted 
the full biosynthetic pathway 
into tobacco plants. — PJH 

Science, this issue p. 1224; 

see also p. 1167 
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GLOBAL CARBON CYCLE 

Uptake uptick 

Has global warming slowed 
the uptake of atmospheric 
CO 2 by the Southern Ocean? 
Landschutzer et al. say no (see 
the Perspective by Fletcher). 
Previous work suggested that 
the strength of the Southern 
Ocean carbon sink fell during the 
1990s. This raised concerns that 
such a decline would exacerbate 
the rise of atmospheric OO 2 
and thereby increase global 
surface air temperatures and 
ocean acidity. The newer data 
show that the Southern Ocean 
carbon sink strengthened again 
over the past decade, which 
illustrates the dynamic nature of 
the process and alleviates some 
of the anxiety about its earlier 
weakening trend. — HJS 

Science, this issue p. 1221; 

see also p. 1165 

ELECTROCHEMISTRY 

Improving 
cobalt catalysts 

Tethering molecular catalysts 
together is a tried and trusted 
method for making them easier 
to purify and reuse. Lin etal. 
now show that the assembly 
of a covalent organic frame- 
work (OOF) structure can also 
improve fundamental catalytic 
performance. They used cobalt 
porphyrin complexes as building 
blocks for a COF. The resulting 
material showed greatly 
enhanced activity for the aque- 
ous electrochemical reduction of 
C02to CO.-JSY 

Science, this issue p. 1208 

NEURONAL IDENTITY 

Changing properties 
of interneurons 

Neuronal identity is determined 
early during development. It is 
assumed that once a neuron has 
adopted its identity, this remains 
stable throughout life. However, 
Dehorter et al. investigated the 
mechanisms controlling the 
specification of fast-spiking 
interneurons. They found that 
the transcription factor Er81 is 
crucial for the normal specifica- 
tion of these cells by regulating 

SCIENCE sciencemag.org 



the expression of a potassium 
channel subunit. — PRS 

Science, this issue p. 1216 



IN OTHER JOURNALS 



Edited hy Jesse Smith 
and Sacha Vignieri 



ANTIVIRAL IMMUNITY 

Viruses pack antiviral 
mediators 

Viruses often hijack host pro- 
teins for their own use, turning 
host cells into virion-spewing 
machines. However, Bridgeman 
etal. and Gentili etal. now report 
a sneaky way that the host can 
fight back (see the Perspective 
by Schoggins). Host cells that 
expressed the enzyme cGAS, 
an innate immune receptor 
that senses cytoplasmic DNA, 
packaged the cGAS-generated 
second messenger cGAMP 
into virions. Virions could then 
transfer cGAMP to neighbor- 
ing cells, triggering an antiviral 
gene program in these newly 
infected cells. Such transfer of 
an antiviral mediator may help to 
speed up the immune response 
to put the brakes on viral spread. 
-KLM 

Science, this issue pp. 1228 and 1232; 

see also p. 1166 



CANCER 

Shutting off 
cancer’s motor 

Glioblastoma is a difficult-to- 
treat common and aggressive 
brain tumor. Two mechanisms 
that contribute to its lethality are 
proliferation of the tumor cells 
and their invasion into normal 
brain tissue. Although these two 
processes are usually believed 
to be independent, Venere etal. 
show that a molecular motor, 
KIFll, plays a role in both. 
Inhibiting KIFll with a small 
molecule blocked the prolifera- 
tion and invasion of glioblastoma 
cells and prolonged survival in 
mouse models of the disease. 
These findings, together with 
the availability of a KIFll 
inhibitor that is safe for human 
use, suggest KIFll as a viable 
therapeutic target for treating 
glioblastoma. — YN 

Sci. Transl. Med. 7 , 304ral43 (2015). 




CLIMATE 

Explaining the iast giacier retreat 

B etween 19,000 and 11,500 years ago, glaciers around 
the globe retreated as Earth emerged from the last 
glacial period. Efforts to identify the main drivers of this 
retreat have been hampered by dating uncertainties. 
Shakun et al. have recalculated the ages of over 1000 
glacial boulders from almost 200 sites around the world, 
using an improved calibration that provides more accurate 
dates. They find that although there are regional variations, 
most glacier retreats coincided with the rise in atmospheric 
carbon during the deglaciation. The high sensitivity of alpine 
glaciers to carbon dioxide increases has implications for 
future glacier loss as a result of climate change. — JEU 

Nat. Commun. 10.1038/ncomms9059 (2015). 



11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 1179 



Published by AAAS 




RESEARCH \ IN OTHER JOURNALS 







Helicobacter pylori 
follow a urea trail to 
the relative safety of 
the gut epithelium 



MICROBIOLOGY 

Stomach microbe finds a safe haven 

H elicobacter pylori causes stomach ulcers and is linked 
to gastric cancer. How does this microbe survive in 
the destructive environment of the stomach? Huang 
etai. looked at how H. pylori manages to make its way 
to the gastric epithelium, where it can reside relatively 
unscathed by the stomach’s defenses. H. pylori express a pro- 
tein chemoreceptor, TIpB, that sniffs out urea released by the 
gut epithelium. The bugs follow a urea trail to the epithelium 
and simultaneously degrade the urea to generate ammonia 
and bicarbonate, which can help to buffer the microbe from 
the stomach acids. The authors watched how the bacteria 
locate and swim toward epithelia within seconds, attracted by 
minute amounts of urea. — SMH 



PROTEIN DESIGN 

Switching off protein 
production 

Controlling protein production is 
desirable, but current methods 
are complex, inefficient, difficult 
to generalize, or not quickly 
reversible. Chung et al. describe 
a small-molecule-assisted 
shutoff (SMASh) tag that is 



Cell Host Microbe 18, 147 (2015). 



genetically added to a target 
protein and allows reversible 
shutoff of various proteins in 
multiple cell types. The tag 
includes a site that is cut by a 
protease and a degron sequence 
that targets the protein for rapid 
destruction. Active protease 
cuts the tag from newly syn- 
thesized protein so that it does 
not disrupt protein function. 



However, inhibiting the protease 
with a clinically approved drug 
protects the tag, and the degron 
sequence causes the protein to 
be rapidly degraded. Stopping 
the drug restores protein pro- 
duction. — VV 

Nat Chem. Biol 10.1038/ 
nchembio.1869 (2015). 

MATERIALS SCIENCE 

Brushing up on being soft 
and squishy 

Polymer gels can be thought of 
as a netlike structure of cross- 
linked linear polymer chains 
swollen with solvent. Because 
these materials are mostly 
liquid, they can be quite soft and 
squishy, but this means that 
solvent loss, which makes the 
gels sticky, also is a concern. 

Cai etal. fabricated very long 
poly(dimethylsiloxane) chains 
with a bottle-brush structure: 
long backbone macromolecules 
heavily decorated with short 
side chains (a bit like a hairy cat- 
erpillar), which prevent the long 
chains from entangling. When 
crosslinked, these chains formed 
soft gellike materials with a 
tunable elastic modulus that 
was dependent on the crosslink 
density. — MSL 

Adv. Mater. 10.1002/ 

adma.201502771(2015). 

METABOLOMICS 

Terroir quantified 

Wines are characterized not 
only by year but by vineyard. 
Anesi etal. used chromatogra- 
phy and microarrays to analyze 
the metabolomes of one variety 
of the Corvina grape variety 
grown in different settings in 
Italy. The study spanned 3 years 
in order to average effects due 
to variation in temperature, 
sunlight, and rainfall. Shifts 
in the composition of volatile 
and nonvolatile compounds 
that correlate with altitude, 
viticultural practices, and char- 
acteristics of the soil revealed 
the signatures attributable to 
terroir. — PJH 

BMC Plant Biol. 10.1186/ 
sl2870-015-0584-4(2015). 



NEURODEVELOPMENT 

Brain development 
hamstrung 

Mutations in a zinc-finger 
transcription factor known as 
Zic2 cause problems in cortical 
development and can result 
in schizophrenia symptoms. 

Studying embryonic mice, 

Murillo et al. show that this 
protein supports migrations of 
several types of neural cells dur- 
ing brain development. The loss 
of Zic2 hobbled Cajal-Retzius 
cells, which lay the groundwork 
for further brain development, 
as well as cells destined to build 
key relay and integrative centers 
in the brain. Normal in number, 
the stranded Cajal-Retzius cells 
release their signaling molecule 
reelin in the wrong places. The 
migration problems may also 
range through neural develop- 
ment, affecting cells from the 
prechordal plate to the neural 
crest that emigrate from the 
neural tube. — PJH 

J.Neurosci. 35,32(2015). 

EDUCATION 

Assessing chemistry one 
ciass at a time 

Before changes in a curriculum 
can be proposed, it is neces- 
sary to assess the current state 
of the course. Fox and Roehrig 
did exactly that when surveying 
undergraduate physical chemis- 
try courses in the United States. 

Data were collected about issues 
such as depth versus breadth of 
content, types of assessments, 
teacher preparation, and beliefs 
about the challenging nature 
of physical chemistry, allow- 
ing more-informed decisions 
to be made about the future of 
physical chemistry education. 

Two main themes uncovered by 
the survey, “how can instructors 
clearly translate learning goals 
into assessments” and “how can 
instructors gain more teacher 
preparation experience,” are o 

relevant to all science disciplines § 

and can be used as a starting | 

point for future science educa- | 

tion research. — MM ^ 

J. Chem. Edu. 10.1021/ § 

acs.jchemed. 5500070 (2015). S 

sciencemag.org SCIENCE 
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HUMAN GENOMICS 

Duplications and deietions 
in the human genome 

Duplications and deletions can 
lead to variation in copy number 
for genes and genomic loci 
among humans. Such variants 
can reveal evolutionary patterns 
and have implications for human 
health. Sudmant etal. examined 
copy-number variation across 
236 individual genomes from 
125 human populations. 
Deletions were under more 
selection, whereas duplications 
showed more population- 
specific structure. Interestingly, 
Oceanic populations retain 
large duplications postulated 
to have originated in an ancient 
Denisovan lineage. — LMZ 

Science, this issue p. 1181 

MESOSCOPIC PHYSICS 

Making and manipulating 
aweak-iinkqubit 

In superconductors, single 
particles cannot have energies 
smaller than the supercon- 
ducting gap. Yet when two 
superconductors are separated 
by a thin nonsuperconducting 
bridge (the “weak link”), quasi- 
particles can occupy states that 
are inside the gap, the so-called 
Andreev bound states (ABSs). 
Janvier et al. fabricated such a 
structure out of superconduct- 
ing aluminum and manipulated 
the occupation of a pair of ABSs. 
They observed oscillations in 
population between two of the 
energy levels, forming a type of 
qubit, which they dubbed the 
Andreev qubit. The results may 



lead to applications in quantum 
information processing. — JS 

Science, this issue p. 1199 

GEOPHYSICS 

Faults well oriented for 
failure 

A deck of cards will remain 
motionless while pressed on 
from above, but easily sepa- 
rates when sheared. Similarly, 
the ease by which geological 
faults rupture depends on the 
geometry of the fault relative to 
the size and direction of stress. 
Hardebeck finds that faults 
are well oriented for failure in 
subduction zones worldwide, 
suggesting a low-stress envi- 
ronment (see the Perspective 
by Burgmann). Subduction 
zone faults unleash powerful 
earthquakes. These estimates 
on the state of stress constrain 
potential generation mecha- 
nisms of destructive subduction 
zone earthquakes. — BG 

Science, this issue p. 1213; 

see also p. 1162 

CHROMOSOMES 

A misstep in the 
chromosomes’ dance 

The chromosomes of most 
cancer cells are unstable, suffer- 
ing breaks and rearrangements. 
These aberrations perturb nor- 
mal gene-regulatory programs 
and contribute to cancer progres- 
sion. Tanno et al. studied several 
human cell lines that show chro- 
mosome instability. Homologous 
chromosomes did not pair very 
well with each other during cell 



division. These chromosomes 
had lost a specific chromatin 
covalent modification, methyla- 
tion of histone H3 lysine 9, and/ 
or the protein “glue” cohesin at 
the chromosomes’ centromeres. 
Either of these losses interfered 
with the binding of the shugoshin- 
like protein 2, which is required 
for proper chromosome pairing 
and segregation. — GR 

Science, this issue p. 1237 



CANCER 

RNF43 halts Wnt at the 
nucleus 

Wnt signaling, which triggers 
the transcriptional activity of 
(3-catenin, is critical during devel- 
opment and is often reactivated 
in cancer. Loregger etal. found 
that RNF43 inhibited Wnt-(3- 
catenin signaling by sequestering 
TGF4, a p-catenin partner and 
transcription factor, at the nuclear 
membrane. Wnt signaling was 
increased in frog embryos that 
expressed forms of RNF43 with 
mutations such as those found in 
human gastrointestinal tumors. 
Furthermore, coexpression of 
wild-type RNF43 suppressed 
Wnt signaling even in cells with 
a constitutively active mutant of 
p-catenin. — LKF 

Sci. Signal. 8 , ra90 (2015). 



EVOLUTION 

Symbioses between 
single-celled organisms 

Examples abound of long-term 
mutually beneficial relationships 
between microorganisms and 
plants or animals. In contrast. 



only a few examples of such 
symbioses between unicel- 
lular partners are known. In a 
Perspective, Zehr highlights 
recent reports of unicellular 
symbioses in marine and fresh- 
water systems. In both cases, 
the cyanobacterial partner in the 
symbiosis has a much reduced 
genome and must thus always 
carry their symbiont. These 
systems resemble organelle 
evolution and may help to under- 
stand the evolutionary events 
that led to organelle formation in 
eukaryotic cells. — JFU 

Science, this issue p. 1163 



QUANTUM MECHANICS 

Interfering with time 

The interference pattern arising 
from light or particles passing 
through a double slit is a simple 
experiment that belies the 
subtleties of interpretation 
when attempting to describe 
and understand the effect. For 
example, determining “which 
path” the light or particles travel 
can result in the interference 
pattern disappearing. Margalit 
et al. present a new take on 
interferometry using time 
(see the Perspective by Arndt 
and Brand). A clock— i.e., the 
internal state of a cold atom 
condensate— was coherently 
split and brought back together 
to interfere. Making one-half 
of the clock tick at a different 
rate resulted in a change in the 
interference pattern, possibly as 
a consequence of the time being 
a “which path” witness. — ISO 

Science, this issue p. 1205; 

see also p. 1168 
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Damian Labuda, Mait Metspalu, Sarah A. Tishkofif, Stanislav Dryomov, Rem Sukemik, 

Nick Patterson, David Reich, Evan E. Eichler* 



INTRODUCTION: Most Studies of human ge- 
netic variation have focused on single-nucleotide 
variants (SNVs). However, copy-number var- 
iants (CNVs) affect more base pairs of DNA 
among humans, and yet our understanding 
of CNV diversity among human populations is 
limited. 

RATIONALE: We aimed to understand the 
pattern, selection, and diversity of copy-number 
variation by analyzing deeply sequenced ge- 
nomes representing the diversity of all humans. 
We compared the selective constraints of de- 
letions versus duplications to understand pop- 
ulation stratification in the context of the 



ancestral human genome and to assess dif- 
ferences in CNV load between African and 
non-African populations. 

RESULTS: We sequenced 236 individual ge- 
nomes from 125 distinct human populations and 
identified 14,467 autosomal CNVs and 545 
X-linked CNVs with a sequence read-depth 
approach. Deletions exhibit stronger selective 
pressure and are better phylogenetic markers 
of population relationships than duplication 
polymorphisms. We identified 1036 population- 
stratified copy-number-variable regions, 295 
of which intersect coding regions and 199 of 
which exhibit extreme signatures of differ- 



entiation. Duplicated loci were 1.8-fold more 
likely to be stratified than deletions but were 
poorly correlated with fianking genetic diver- 
sity. Among these, we highlight a duplication 
polymorphism restricted to modern Oceanic 
populations yet also present in the genome of 
the archaic Denisova hominin. This 225-kilo- 
base pair O^bp) duplication includes two micro- 
RNA genes and is almost fixed among human 
Papuan-Bougainville genomes. 

The data allowed us to reconstruct the an- 
cestral human genome and create a more ac- 
curate evolutionary framework for the gain 
and loss of sequences dur- 
ing human evolution. We 
identified 571 loci that 
segregate in the human 
population and another 
2026 loci of fixed-copy 2 
in all human genomes but 
absent from the reference genome. The to- 
tal deletion and duplication load between 
African and non-African population groups 
showed no difference after we account for 
ancestral sequences missing from the human 
reference. However, we did observe that the 
relative number of base pairs affected by 
CNVs compared to single-nucleotide poly- 
morphisms is higher among non-Africans 
than Africans. 



ON OUR WEB SITE 



Read the full article 
at http://dx.doi. 
org/10.1126/ 
science. aab3761 



CONCLUSION: Deletions, duplications, and 
CNVs have shaped, to different extents, the 
genetic diversity of human populations by 
the combined forces of mutation, selection, 
and demography. ■ 



The list of author affiliations is available in the full article online. 

*Corresponding author. E-mail: eee@gs.washington.edu 
Cite this article as P. H. Sudmant et al.. Science 349, 
aab3761 (2015). DOI: 10.1126/science.aab3761 




Global human CNV diversity and archaic introgression of a chromosome 16 duplication. (Left) The geographic coordinates of populations 
sampled are indicated on a world map (colored dots). The pie charts show the continental population allele frequency of a single ~225-kbp duplication 
polymorphism found exclusively among Oceanic populations and an archaic Denisova. (Right) The ancestral structure of this duplication locus (1) and the 
Denisova duplication structure (2) are shown in relation to their position on chromosome 16. We estimate that the duplication emerged -440 thousand 
years ago (ka) in the Denisova and then introgressed into ancestral Papuan populations -40 ka. 
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In order to explore the diversity and selective signatures of duplication and deletion human 
copy-number variants (CNVs), we sequenced 236 individuals from 125 distinct human 
populations. We observed that duplications exhibit fundamentally different population 
genetic and selective signatures than deletions and are more likely to be stratified between 
human populations. Through reconstruction of the ancestral human genome, we identify 
megabases of DNA lost in different human lineages and pinpoint large duplications 
that introgressed from the extinct Denisova lineage now found at high frequency 
exclusively in Oceanic populations. We find that the proportion of CNV base pairs to 
single-nucleotide-variant base pairs is greater among non-Africans than it is among African 
populations, but we conclude that this difference is likely due to unique aspects of 
non-African population history as opposed to differences in CNV load. 



I n the past decade, genome sequencing has 
provided insights into demography and mi- 
gration patterns of human populations {l-4\ 
ancient DNA (5-7), de novo mutation rates 
(8-10), and the relative deleteriousness and 
frequency of coding mutations (11, 12). Global 
human diversity, however, has only been par- 
tially sampled, and the genetic architecture of 
many populations remains uncharacterized. To 
date, the majority of human diversity studies 
have focused on single-nucleotide variants (SNVs), 
although copy-number variants (CNVs) have 
contributed significantly to hominid evolution 
(13, 14), adaptation, and disease (15-18). Much 
of the research into CNV diversity has been 
performed with single-nucleotide polymorphism 
(SNP) microarray and array comparative ge- 
nomic hybridization (aCGH) platforms (19-22), 
which provide limited resolution. In addition, 
comparisons of population CNV diversity with 
heterogeneous discovery platforms may lead to 
spurious population-specific trends in CNV di- 
versity (22, 23). Although there are many other 
forms of structural variation (e.g., inversions or 
mobile element insertions), in this study we 
focused on understanding the population genet- 
ics and normal pattern of copy-number var- 



iation by deep sequencing a diverse panel of 
human genomes. 

Results 
CNV discovery 

We sequenced to high coverage a panel of 236 
human genomes representing 125 diverse human 
populations from across the globe (Fig. 1 and 
table S2). Sequencing was performed to a mean 
genome coverage of 41-fold from libraries pre- 
pared by using a standard polymerase chain 
reaction-free protocol on the HiSeq 2000 II- 
lumina (San Diego, CA) sequencing platform 
(24). The panel includes representation from a 
broad swathe of human diversity, including in- 
dividuals from across Siberia, the Indian sub- 
continent, and Oceania. We also analyzed the 
high-coverage archaic Neanderthal (25) and 
Denisova (26) as well as three ancient human 
genomes to refine the evolutionary origin and 
timing of CNV differences (24). We applied a 
read-depth-based digital comparative genomic 
hybridization (dCGH) approach (13, 24) to iden- 
tify 14,467 autosomal CNVs and 545 X-linked 
CNVs among individuals relative to the reference 
genome (Table 1 and table SI), which we esti- 
mate provides breakpoint resolution to -210 base 



pairs (bp) (24). CNV calls were validated with SNP 
microarrays and a custom aCGH microarray that 
targeted all CNVs identified in 20 randomly se- 
lected individuals (24). 

The median CNV size was 7396 bp, with 82.2% 
of events (n = 12,338) less than 25 kbp (24). CNVs 
mapping to segmental duplications were larger 
on average (median of 14.4 kbp) than CNVs map- 
ping to the unique portions of the genome (med- 
ian of 6.2 kbp). Almost one-half of CNV base pairs 
mapped within previously annotated segmental 
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Fig. 1. Analysis of CNVs in several world populations. The geographical locations of the 125 human populations, including two archaic genomes, 
assessed in this study. Populations are colored by their continental population groups, and archaic individuals are indicated in black. 



Table 1. CNVs and SNVs broken down by their intersection with genomic region. The number 
of mega-base pairs of exonic and segmentally duplicated CNVs reflects the amount of exonic and 
segmental duplication sequences affected, respectively, not the total sum of the intersecting CNVs. 



Class 


Autosomal (Mbp) 


X chromosome (Mbp) 


Exonic (Mbp) 


Segmentally 
duplicated (Mbp) 


Deletions 


7,233 (78.99) 


278 (6.61) 


636 (0.32) 


331 (8.47) 


Duplications 


7,234 (129.62) 


267 (6.46) 


2,093 (1.56) 


4,462 (96.93) 


Subtotal 


14,467 (204.54) 


545 (12.61) 


2,729 (1.84) 


4,793 (99.84) 


SNVs 


32,630,650 (32.63) 


1,175,170 (1.18) 


314,872 (0.31) 


1,559,158 (1.56) 


All 


32,645,117 (237.17) 


1,175,715 (13.79) 


317601 (2.15) 


1,563,951 (101.4) 



duplications (a 10-fold enrichment) (Table 1). In 
total, 217.1 Mbp (7.01%) of the human genome 
are variable because of CNVs, in contrast to 
33.8 Mbp (1.1%) resulting from single-nucleotide 
variations (Table 1). Deletions (loss of sequence) 
were less common (representing 85.6 Mbp or 
2.77% of the genome) compared with duplications 
(gain of sequence, 136.1 Mbp or 4.4% of the ge- 
nome). Furthermore, comparing our data set with 
other studies of CNVs (21, 27), 67 to 73% of calls we 



report are unique to our study, whereas we cap- 
tured 68 to 77% of previously identified CNVs {24). 

CNV diversity and seiection 

African populations are broadly distinguished 
from non-African populations by a principal 
component analysis (PCA) for either deletions 
(Fig. 2A and fig. S20) {24) or duplications (Fig. 
2B). In this analysis, we limited the variants to 
biallelic deletions or biallelic duplications (dip- 



loid genotypes of two, three, or four) to eliminate 
difficulty of inferring phase from multicopy CNVs. 
For deletions, PCI (6.8% of the variance) and PC2 
(3.94%) distinguish Africans, West Eurasians, 
East Asians, and Oceanic populations. PC3 and 
PC4, describing 2.8% and 2.0% of the total var- 
iance, cluster Papuans and populations of the 
Americas, respectively. Many other populations 
were predictably distributed along clines between 
these clusters (e.g.. Northern Africans, Siberians, 
South Asians, Amerindians, and indigenous peo- 
ples of Philippines and North Borneo). PCAs gen- 
erated from SNVs showed patterns similar to 
those from deletions. Africans also show much 
greater heterozygosity (Fig. 2C and Table 2), for 
instance, -25% more heterozygous biallelic dele- 
tions and more than a twofold difference when 
compared with Amerindians (0AMcan = 535 versus 
0Americas = 209). The archaic Neanderthal and 
Denisova genomes form an out-group to all 
humans {24). 

Duplication heterozygosity and PCA in general 
show similar trends (Fig. 2D), albeit with far less 
definition. Oceanic populations, especially those 
from Papua New Guinea, Australia, and Bou- 
gainville, showed the greatest separation on PCI 
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Fig. 2. Population structure and CNV diversity. PCA of individuals assessed in this study plotted for biallelic deletions (A) and duplications (B) with colors and 
shapes representing continental and specific populations, respectively. Individuals are projected along the PCI and PC2 axes. The deletion (C) and duplication 
(D) heterozygosity plotted and grouped by continental population. The relationship between SNV heterozygosity and deletion (E) or duplication (F) heterozygosity 
is compared. 



by duplication. Biallelic duplications appear to 
be somewhat less-informative markers of human 
ancestry, in contrast to SNVs, which provide the 
greatest resolution (e.g., SNV PCs 1 to 4 describe 
5.8, 3.4, 2.6, and 1.7% of the variance, respec- 
tively). This difference is also seen when compar- 
ing SNV and CNV heterozygosity (Fig. 2, E and 
F). Whereas heterozygous biallelic deletions were 
strongly correlated (R = 0.88) with SNV het- 
erozygosity, the correlation between SNVs and 
duplications was much weaker (R = 0.27). We 
compared this correlation for duplications lo- 



cated adjacent to segmental duplications (within 
or proximal 150 kbp) in contrast to those oc- 
curring in unique regions of the genome and 
therefore less likely to be subject to recurrent 
mutation. Heterozygous duplications occurring 
in unique regions were better correlated with 
heterozygous SNVs (r = 0.29) than those adjacent 
or within segmental duplications (r = 0.17), al- 
though the difference was not significant (two- 
sided Williams’ test P < 0.1). 

Studies of larger (>100 kbp) deletion and du- 
plication events indicate that deletions are more 



deleterious than duplications (28). We reasoned 
that this may be reflected in the allele frequency 
spectrum (AFS) of normal genetic variation and 
compared the AFS of genic versus intergenic de- 
letions and duplications for smaller events (Fig. 
3, A and B). Genic deletions were significantly 
rarer than intergenic deletions (Wilcoxon rank 
sum test, P = 1.84 x 10“^), but genic duplications 
showed no such skew (Wilcoxon rank sum test, 
P = 0.181). Size also had a significant impact on 
the AFS of CNVs. Deletions increased in rarity 
as a function of size (F test, P = 5.02 x 10“^^) 
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Table 2. Summary statistics of biallelic CNV deletions versus SNVs by continental population group. 



Continental population 
group 


n 


Segregating SNVs 


CNVs/ 

Segregating CNVs individual 
(median) 


Heterozygous 

CNVs/individual 

(median) 


Continental population 
group-specific CNVs 
(allele count > 2) 


0cNv/genome 


West Eurasian (WEA) 


58 


13610715 


1728 


279.0 


209.0 


688 (89) 


324.42 


Oceanic (OCN) 


21 


9467426 


1022 


263.0 


173.0 


353 (84) 


237.51 


East Asian (EA) 


45 


17452049 


1463 


271.0 


191.0 


525 (59) 


288.48 


Siberian (SIB) 


23 


9644914 


1102 


285.0 


205.0 


214 (30) 


250.74 


South Asian (SA) 


27 


11308883 


1405 


279.0 


208.0 


418 (43) 


308.32 


Americas (AMR) 


21 


8127639 


899 


266.0 


169.0 


208 (25) 


208.93 


African (APR) 


41 


21698517 


2663 


319.0 


261.0 


1772 (702) 


534.97 



A 



deletions 



duplications 




allele frequency 

C 




allele frequency 

D 




Fig. 3. Selection on CNVs. Folded allele-frequency spectra of exon-intersecting deletions (A) and dup- 
lications (B). Whereas deletions intersecting exons are significantly rarer than intergenic deletions, exon- 
intersecting duplications show no difference compared to intergenic duplications. The mean frequency of 
CNVs beyond a minimum size threshold is plotted for deletions (C) and duplications (D). A strong negative 
correlation between size and allele frequency is observed for deletions but less so for duplications. 



(Fig. 3C), but only a nominally significant trend 
was observed for duplications (P = 0.031) (Fig. 
3D). These data suggest that selection has shaped 
the extant diversity of deletions and duplications 
differently during human evolution. 

Population stratification 

Because population stratification can be indic- 
ative of loci under adaptive selection, we cal- 
culated Vst statistics for each CNV among all 
pairs of continental population groups, a metric 
analogous to Pgt (the fixation index) (29). Vgt and 
Pst statistics compare the variance in allele fre- 
quencies between populations, with Fgt allowing 
comparison of multiallelic or multicopy CNVs. We 



identified 1036 stratified copy-number-variable 
regions (CNVRs with maximum population Fgt > 
0.2, -10% of the total), 295 of which intersected the 
exons of genes and 199 that exhibited extreme 
stratification (Fst > 0-^) (table S3). After correct- 
ing for copy number, duplicated loci were 1.8-fold 
more likely to be stratified than deletions. This 
finding is more remarkable in light of the fact 
that duplications were less discriminatory by 
PCA, suggesting that a subset of multiallelic 
duplicated CNVs show large allele frequency 
differences between different populations (see 
discussion below). The Fst of stratified dupli- 
cated CNVs was weakly correlated with the 
of flanking SNVs (R^ = 0.03, P = 3.27 x 10“^^) 



in contrast to deletions (R^ = 0.2, P <2 x 10“^^). 
Stratified duplication loci, thus, are far less like- 
ly to be tagged by adjacent SNPs through link- 
age disequilibrium. 

Many of the population-differentiated loci were 
multiallelic and mapped to segmental duplica- 
tions, including the repeat domain of ANKRD36 
and the DUF1220 domain of NBPF (24) (Table 3). 
Several of these population differences involve 
genes of medical consequence, such as the mul- 
tiallelic duplication of CLPS, a pancreatic colipase 
involved in dietary metabolism of long-chain 
triglyceride fatty acids (Fig. 4A). Increased ex- 
pression in mouse models of this gene is nega- 
tively correlated with blood glucose levels (30). 
A duplication of the haptoglobin and haptoglobin- 
related (HP and HPR) genes expanded exclu- 
sively in Africa. The duplication has recently been 
associated with a possible protective effect against 
trypanosomiasis in Africa, although only copy 
3 and 4 alleles were reported (31). We find this 
locus has further expanded to five and six copies 
in Esan, Gambian, Igbo, Mandenka, and Yoruban 
individuals (Fig. 4A). We also compared the loca- 
tion of our CNVs with disease loci identified by 
genome-wide association study (GWAS) (32) and 
sites of potential positive selection (33). Although 
only a small fraction of our CNVs (1 to 6%) over- 
lapped such functional annotation, we note that 
21% of putative adaptive loci intersected with a 
CNV when compared with 6% of disease GWAS 
loci (table S4). Because many of the intervals 
are large, further refinement and investigation 
are needed to determine the importance of such 
overlaps. 

Denisovan CNVs are retained and 
expanded in Oceanic populations 

We fiuther searched for highly stratified population- 
specific CNVs sharing alleles with the archaic 
Neanderthal and Denisovan individuals assessed 
in our study. Although no Neanderthal-shared 
population-specific CNVs were identified, five 
Oceanic-specific CNVs were identified that shared 
the Denisova allele at high frequency (24). Papuan 
genomes have previously been reported to harbor 
3 to 6% Denisovan admixture (6, 26). CNVs of 
putative Denisovan ancestry were at remark- 
ably high frequency in Papuan individuals (all 
>0.2 allele frequency), with one -9-kbp deletion 
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Table 3. CNVs differentiated between human populations. CNVs intersecting genes that show dramatic difference in copy number (as measured by Vst) 
between human populations (see Fig. 1 for definition of populations). 



Locus 


Genes 




Copy range 


Description 


chr2:97849921-97899292 


ANKRD36 


0.49 (OCN-WEA) 


30-41 


Repeat domain expanded to 45 copies 
in Papuans. 










Expansion of the DUF1220 repeat domain 


chrl:144146792-144224420 


NBPF 


0.32 (AFR-EA) 


185-271 


in Africans and Amerindians. Copy number 
associated with cognitive function and 
autism severity (47). 










Pancreatic colipase involved in dietary 
metabolism of long-chain triglyceride 


chr6:35749042-35767153 


CLPS 


0.29 (AMR-SA) 


2-6 


fatty acids. Increased expression is 
negatively correlated with blood glucose 
in mice (30). 










Haptoglobin and haptoglobin-related genes 


chrl6:72088031-72119241 


HP, HPP 


0.25 (AFR-WEA) 


1-6 


are expanded exclusively in Africa and 
associated with a possible protective 
effect against trypanosomiasis (31). 










DPY genes are required for sperm head 
elongation and acrosome formation 
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during spermatogenesis, and DPY19L2 
homozygous deletions have been identified 
as a major cause of globozoospermia (48). 
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LRRIQ3 is duplicated exclusively in 
Siberian and Amerindian populations. 










Deletions of corticotropin-releasing hormone 


chrl7:43692284-43708692 


CRHRl 


0.25 (EA-WEA) 
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receptor 1 result in reduced anxiety and 
neurotransmission impairments in mice (49). 
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IRGM promoter 
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The IRGM promoter CNV is a Crohn’s disease 
risk factor (50). 


chr3:195771149-195776591 


TFRC promoter 


0.57 (AFR-EA) 


0-2 


Transferrin receptor is a cellular receptor for 
New World hemorrhagic fever arenaviruses (51). 



lying 2 kbp upstream of the long noncoding 
RNALINC00501, another 5-kbp duplication lying 
8 kbp upstream of the METTL9 methyltransferase 
gene, and a 73.5-kbp duplication intersecting the 
MIR548D2 send MIR548AA2 microRNAs (Fig. 4B). 

We determined that the latter two are part 
of a larger composite segmental duplication 
that appears to have almost fixed among human 
Papuan-Bougainville genomes [allele frequency 
(AF) = 0.84] but has not been observed in any 
other extant human population (Fig. 4, B and C). 
We noted three additional duplications proximal 
to this locus exhibiting strikingly correlated copy 
number, despite being separated by >1 Mbp in 
the reference genome (Fig. 4C) (24). We suggest 
that these constitute a single, larger (-225 kbp) 
complex duplication composed of different seg- 
mental duplications. By using discordantly map- 
ping paired-end reads, we resolved the organization 
of two duplication architectures not represented 
in the human reference (Fig. 4D). The first of which 
(architecture A/C) is present in all individuals as- 
sessed in this study (5625 discordant paired-end 
reads supporting) but not in the human reference 
genome. The second (B/D) corresponds to the 
Denisova-Papuan-specific duplication and is only 
present in these individuals and the Denisova ge- 
nome. Seventy paralogous sequence variants [mark- 



ers distinct to paralogous locus (34, 35)] distinguish 
the Papuan duplication, of which 65/70 (92.9%) 
were shared with the archaic Denisova genome. 
On the basis of single-nucleotide divergence, we 
estimate that the duplication emerged -440 thou- 
sand years ago (ka) and rose to high frequency in 
Papuan (>0.80 AF) but not Australian genomes, 
probably over the past 40,000 years after introgres- 
sion from Denisova (Fig. 4E). This polymorphism 
represents the largest introgressed archaic hom- 
inin duplication in modern humans. 

The ancestral human genome 

The breadth of the data set allowed us to recon- 
struct the structure and content of the ancestral 
human genome before human migration and 
subsequent gene loss. To identify ancestral se- 
quences potentially lost by deletion, we iden- 
tified a set of sequences present in chimpanzee 
and orangutan reference genomes but absent 
from the human reference genome (20,373 non- 
redundant loci corresponding to 40.7 Mbp of se- 
quence). Of these, 9666 (27.6 Mbp) were unique 
(i.e., not composed of common repeats). Because 
of the inability to accurately genotype copy num- 
ber for unique segments less than 500 bp by 
read-depth analysis, we limited our ancestral 
reconstruction to nonrepetitive sequences greater 



than this length threshold. Although the major- 
ity represented deletions specifically lost in the 
human lineage since divergence from great apes 
(6341 loci) or else referenced genome artifacts 
(2026 loci fixed-copy 2 in all individuals as- 
sessed, 6.2 Mbp), a small subset of these (n = 571 
or 1.55 Mbp) segregate as biallelic polymorphisms 
in human populations (Fig. 5A). As expected, 
Africans were more likely to show evidence of 
these ancestral sequences compared with non- 
African populations, because the latter have ex- 
perienced more population bottlenecks and thus 
retained less of the ancestral human diversity. A 
comparison to archaic genomes allowed us to 
identify sequences (50 loci or 104 kbp) that were 
present in Denisova or Neanderthal but lost in 
all contemporary humans as well as ancestral 
sequences present in all humans but not found 
in Denisova or Neanderthal (17 loci or 33.3 kbp). 

No difference in the CNV load between 
Africans and non-Africans 

The high coverage and uniformity allowed us to 
contrast putatively deleterious, exon-removing 
CNVs among human populations, which are of 
interest in disease studies (36-38). In our call set, 
we identified 2437 CNVRs intersecting exons. 
The distribution of allele counts of these tended 
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Fig. 4. Population-stratified CNVs and archaic introgression. (A) Four 
specific examples of population-stratified CNVs intersecting genes are shown, 
including LRRIQ3, the pancreatic collipase CLPS, the sperm head and ac- 
rosome formation gene DPY19L2, and the haptoglobin and haptoglobin- 
related genes HP and HPR. Dot plots indicate the copy of the locus in each 
individual, and pie charts with colors depict the continental population dis- 
tribution per copy number (see text for details and Figs. 1 and 2 and dot 
plots for color scheme). (B) Predicted copy number on the basis of read depth 
for a 73.5-kbp duplication on chromosome 16. It is observed in the archaic 
Denisovan genome and at 0.84 allele frequency in Papuan and Bougainville 
populations, yet absent from all other assessed populations. The duplication 
intersects two microRNAs.The orange arrow corresponds to the position and orientation of this duplication as further highlighted in (C) and (D). (C) A heat map 
representation of a ~1-Mbp region of chromosome 16pl2 (chrl6:21518638-22805719). Each row of the heat map represents the estimated copy number in 1-kbp 
windows of a single individual across this locus. Genes, annotated segmental duplications, arrows highlighting the size and orientation in the reference of the 
Denisova/Papuan-specific duplication locus (locus D), and three other duplicated loci (A, B, and C) of interest are shown below. (D) The structure of 
duplications A, B, C, and D [as shown in (C) over the same locus] in the reference genome and the discordant paired-end read placements used to characterize two 
duplication structures. Structure A/C is found in all individuals, although not present in the reference genome, whereas structure B/D is only found in Papuan and 
Bougainville individuals, indicating a large (-225 kbp), complex duplication composed of different segmental duplications. Both the A/C and B/D duplication 
architectures exhibit inverted orientations compared with the reference. The number of reads in all Oceanic and non-Oceanic individuals supporting each structure 
are indicated. (E) Maximum likelihood tree of the 16pl2 duplication locus [duplication D in (B) to (D)] constructed from the locus in orangutan, Denisova, the 
human reference, and the inferred sequence of the Papuan duplication (24). All bootstrap values are 100%. 
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Fig. 5. The ancestral human genome and CNV burden. (A) A heat map of 
the allele frequency of 571 (1.55 Mbp) nonrepetitive sequences absent from the 
human reference genome yet segregating in at least one population ordered in 
humans by a maximum likelihood tree (49). Four groups of interest are 
highlighted: Gl, ancestral sequences that have almost been completely lost 
from the human lineage: G2, ancestral sequences that are largely fixed but 
rarely deleted (also absent in human reference): G3, ancestral sequences that 
have become copy-number variable since the divergence of humans and 
Neanderthals/Denisovans -700 ka; and G4, sequences potentially lost in 
Neanderthals and Denisovans since their divergence from humans. (B) The 



resulting distributions of 10,000 block-bootstrapped estimates of the dif- 
ference in load between African (AFR) and non-African (nAFR) populations 
considering only the reference genome (GRGh37) and supplemented by se- 
quence absent from the human reference genome (GRGh37 + NHP) included 
(see text for details). (C) Violin plots of the distribution of the ratio of dele- 
tion base pairs to SNV base pairs differing between every pair of African 
individuals (AFR-AFR), all pairs of non-African individuals (nAFR-nAFR), and 
every non-African, African pair (nAFR-AFR). (D) Heat map representation of 
the mean ratio of deletion to SNV base pairs differing between individuals 
from pairs of populations. 



toward lower-frequency events with, again, dele- 
tions more rare than duplications (Wilcoxon rank 
sum test, P = 1.25 x 10"^). Collectively, individuals 
harbor a mean of 19.2 exon-intersecting deletions 
per genome (22.8 per diploid genome), with 
African individuals exhibiting, on average, a mean 
of 22.4 deletions compared with 18.6 in non- 
Africans (26.1 and 22.1 per diploid genome, re- 
spectively), consistent with the increased diversity 
of African populations and consistent with data 
observed for loss-of-function SNVs {{12, 39), 
-122 LoF SNVs in Africans versus -104 in non- 
Africans]. 

Whereas non-African individuals exhibited 
more homozygous deletion variants compared 
with Africans, among exon-intersecting deletions 
no such pattern was observed. Exon-intersecting 
duplications were much more balanced, with 
African populations showing only a slight excess 
when compared to non-Africans (98.4 versus 
95.2 events per genome). Studies of SNVs have 
not found consistent evidence of a difference in 
load between African compared to non-African 



populations (40-42). We compared the difference 
in load between African and non-African popula- 
tions for deletions and duplications, respectively. 
Here, we defined the difference in load as the 
difference in the sum of derived allele frequencies 
between African and non-African populations, 

L(Afr)-L(nAfr) = XI ^nAfrW where 

V^• V^• 

Pm{i) is the derived allele frequency of a variant i 
Prima facie Africans exhibited an apparent high- 
er deletion load than non-African populations 
(Fig. 5B) (P = 0.0003, block bootstrap test), al- 
though there was only a nominal difference in 
the load of exonic deletions (P = 0.0482). Dup- 
lications showed no such effect. 

We reasoned that this difference might poten- 
tially be driven by high-frequency-derived alleles, 
absent from the human reference genome, which 
was enriched for clone libraries of non-African 
ancestry (5). Approaches that rely on identifying 
CNVs based on read placements to the reference 
genome would necessarily miss these CNVs, de- 
creasing the number of variants identified in indi- 



viduals more closely resembling the reference, i.e., 
non-Africans. To test this hypothesis, we incorpo- 
rated the biallelic 571 nonrepetitive human CNV 
loci described above. Copy numbers were estimated 
for these sequences in each of the individuals 
and assessed by remapping raw reads against an 
ancestral human reference genome. As expected, 
the deletion allele of this sequence was at a high 
frequency (mean derived allele frequency, DAF = 
0.58). After including these sequences, we observed 
no difference in the CNV load between Africans 
and non-Africans (95% confidence interval -18.4 
to 8.8 load difference as defined above) (Fig. 5B), 
underscoring the importance of an unbiased human 
reference for such population genetic assessments. 

Although we found no CNV or SNV load differ- 
ences between populations, we examined wheth- 
er the relative proportion of base pairs differing 
among individuals derived from CNVs versus 
SNVs showed any population-specific trends. We 
calculated the number of base pairs varying be- 
tween all pairs of individuals assessed in our study 
contributed either from SNVs or from deletions. 
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calculating the DEL-bp/SNV-bp ratio. As expected, 
the number of base pairs differing between in- 
dividuals by deletions or by SNVs independently 
was always higher among African individuals 
when compared with other populations. Unex- 
pectedly, the ratio of deletion-bp to SNV-bp was 
substantially higher within non-African pop- 
ulations (mean of 1.27 compared to 1.14; Fig. 5, 
C and D). This relative increase in deleted base 
pairs was most pronounced among non-African 
populations, which have experienced more recent 
genetic bottlenecks (e.g., Siberian and Amerin- 
dian). Given the absence of a significant difference 
in the deletion load comparing African and non- 
African populations, there is no reason to believe 
that this finding is due to differences in the ef- 
fectiveness of selection against deletions since the 
populations separated. However, selection places 
a downward pressure on the allele frequencies of 
both deletions and SNVs, with the pressure being 
stronger for deletions because the selection coef- 
ficients are stronger on average. As has been pre- 
viously shown for SNVs, different allele-frequency 
spectra for deletions in contrast to SNVs have the 
potential to interact with the differences in demo- 
graphic history across populations— even without 
differences in the effectiveness of selection after 
population separation— to contribute to observed 
differences in the apportionment of genetic var- 
iation among human populations {41). 

Discussion 

Although the mutational properties and selective 
signatures of SNVs have been explored exten- 
sively, similar analyses of CNVs have lagged be- 
hind. As a class, duplications show generally poor 
correlations with SNV density, have poor linkage 
disequilibrium to SNVs {43, 44), and are less in- 
formative as phylogenetic markers but are more 
likely to be stratified than deletions among human 
populations. This observation may be explained 
by the fact that directly orientated duplications 
show a gradient of elevated mutation rates result- 
ing from nonallelic homologous recombination 
and, as such, can change their copy-number state 
more dynamically over short periods of time. This 
property also makes this class of variation, similar 
to highly mutable loci such as minisatellites (45), 
particularly susceptible to homoplasy— that is, 
identity by state as opposed to identity by descent. 
Deletions, in contrast, recapitulate most proper- 
ties of SNVs because they are more likely to 
exhibit identity by descent as a result of single 
ancestral mutation event. 

We have provided here sequencing data for 
the study of human diversity and used this re- 
source to explore patterns of human CNV diver- 
sity at a fine scale of resolution (>1 kbp). As 
expected, human genomes differ more with re- 
spect to CNVs than SNVs, and almost one-half 
of these CNV differences map to regions of seg- 
mental duplication. Both deletion and duplica- 
tion analyses consistently distinguish African, 
Oceanic, and Amerindian human populations. 
Africans show the greatest deletion and dupli- 
cation diversity and have the lowest rate of fixed 
deletions with respect to ancestral human in- 



sertion sequences. Oceanic and Amerindian, in 
contrast, show greater CNV differentiation, likely 
as a result of longer periods of genetic isolation 
and founder effects {46). Among the Oceanic, 
the Papuan-Bougainville group stands out in 
sharing more derived CNV alleles in common 
with Denisova, including a massive interspersed 
duplication that rose to high frequency over a 
short period of time. 

We find that duplications and deletions exhibit 
fundamentally different population-genetic prop- 
erties. Duplications are subjected to weaker se- 
lective constraint and are four times more likely 
to affect genes than deletions (Table 1), indicat- 
ing that they provide a larger target for adaptive 
selection. After controlling for reference genome 
biases, we find no difference in CNV load be- 
tween human populations when measured on 
a per-genome basis, which is what matters to 
disease risk, assuming that CNVs act additively. 
However, we find that the proportion of human 
variation that can be ascribed to CNVs rather 
than to SNVs is greater among non-Africans 
than among Africans. The biological significance 
of this difference should be interpreted cau- 
tiously and will require association studies to 
determine its relevance to disease and other 
phenotypic differences. 
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STRUCTURAL BIOLOGY 

Structure of a yeast spliceosome at 
3.6-angstrom resolution 

Chuangye Yan/* Jing Hang/* Ruixue Wan/* Min Huang/ 

Catherine C. L. Wong/ Yigong Shi^f 

Splicing of precursor messenger RNA (pre-mRNA) in yeast is executed by the spliceosome, 
which consists of five small nuclear ribonucleoproteins (snRNPs), NTC (nineteen complex), 
NTC-related proteins (NTR), and a number of associated enzymes and cofactors. Here, we 
report the three-dimensional structure of a Schizosaccharomyces pombe spliceosome 
at 3.6-angstrom resolution, revealed by means of single-particle cryogenic electron 
microscopy. This spliceosome contains U2 and U5 snRNPs, NTC, NTR, U6 small nuclear 
RNA, and an RNA intron lariat. The atomic model includes 10,574 amino acids from 
37 proteins and four RNA molecules, with a combined molecular mass of approximately 
1.3 megadaltons. Spp42 (Prp8 in Saccharomyces cerevisiae), the key protein component 
of the U5 snRNP, forms a central scaffold and anchors the catalytic center. Both the 
morphology and the placement of protein components appear to have evolved to facilitate 
the dynamic process of pre-mRNA splicing. Our near-atomic-resolution structure of a 
central spliceosome provides a molecular framework for mechanistic understanding of 
pre-mRNA splicing. 



P recursor messenger RNA (pre-mRNA) splic- 
ing, which involves the removal of the 
noncoding introns and the ligation of 
neighboring exons, is a defining feature 
for all eukaryotes (i). Aberrant splicing 
contributes to numerous debilitating diseases 
(2). The yeast spliceosome is a multi-megadalton 
ribonucleoprotein (RNP) complex and consists 
of Ul, U2, U4, U5, and U6 small nuclear RNPs 
(snRNPs), NTC (nineteen complex; known as 
the Prpl9-CDC5L complex in mammals), NTC- 
related proteins (NTR), and numerous other en- 
zymes and cofactors (3, 4). The spliceosome 
exhibits exceptional compositional dynamics and 
conformational flexibility, which is consistent 
with its function of splicing introns with diverse 
sequences (5-7). 

At the beginning of a splicing reaction cycle, 
the 5'-splice site (5'SS) of an intron is recognized 
by Ul snRNP, and the branch point sequence 
(BPS) and 3'SS are bound by U2 snRNP, forming 
the spliceosomal A complex. Recruitment of the 
U4/U6.U5 tri-snRNP leads to assembly of the pre- 
catalytic B complex. RNP rearrangement causes 
displacement of Ul and U4 snRNPs and recruit- 
ment of NTC and NTR, generating the activated 
B complex (B^). The B^ spliceosome is converted 
into the catalytically competent B* * complex. 
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which catalyzes the first step of trans-esterification. 
The end product is the C complex, also known as 
the catalytic step I spliceosome, which contains 
the cleaved 5' -exon and an intron lariat-3 -exon 
intermediate (hereafter, intron lariat). The sec- 
ond step of trans-esterification results in the 
ligation of two exons, forming the post-catalytic 
P complex. Subsequently, the ligated exon is re- 
leased, but the intron lariat remains associated 
with the intron-lariat spliceosomal (ILS) complex. 
Last, the intron lariat is dissociated, and the 
snRNPs, NTC, and NTR are recycled. The RNP 
rearrangements are driven by eight evolution- 
arily conserved DExD/H-type, RNA-dependent 
adenosine triphosphatases (ATPases)/helicases 
(8, 9). 

Biochemical studies during the past few decades 
have provided valuable insights into the mecha- 
nism of pre-mRNA splicing by the spliceosome. 
Analysis of thioester substitutions in pre-mRNA 
identified the spliceosome as a metalloenzyme 
{10-12), with the intramolecular stem loop of U6 
small nuclear RNA (snRNA) coordinating the 
catalytic magnesium ions {13, 14). The spliceosom- 
al active site is centered around loop I of U5 
snRNA, the U2/U6 duplex, and a catalytic cavity 
in Prp8 {IS). Structural investigations of the 
spliceosomal components— exemplified by the 
crystal structures of Prp8 {16), Brr2 {17, 18), and 
the Sm and like-Sm (Lsm) rings (iP-23)— have 
also improved our understanding of pre-mRNA 
splicing. 

Elucidating the molecular mechanisms of the 
spliceosome and the splicing reaction requires 
detailed structural information on the intact 
spliceosome at different stages of its action. The 
large size of the spliceosome and its extraordi- 



nary conformational and compositional diversity 
have made this task a challenge for structural 
biologists {24). These features contrast with the 
other central RNP complex, the ribosome, which 
contains two relatively stable subunits that are 
more amenable to structural investigation {25-27). 
RNA accounts for more than 50% of the total 
molecular mass in the bacterial or mammalian 
ribosome, but less than 10% in the spliceosome. 
The constant flux of protein factors during each 
cycle of the splicing reaction has prevented crys- 
tallization of the spliceosome. 

In the past decade, electron microscopy (EM) 
has been used to visualize various spliceosomal 
complexes, yielding a series of structures at re- 
solutions ranging from 20 to 50 A {28-44). Most 
recently, the cryogenic EM (cryo-EM) structure 
of the spliceosomal U4/U6.U5 tri-snRNP from 
Saccharomyces cerevisiae was determined at 5.9 A 
resolution (45). Here, we report the cryo-EM 
structure of a yeast spliceosome at 3.6 A resolu- 
tion. We describe the overall structure of the 
spliceosome and its protein components in this 
Research Article and discuss the structural in- 
sights into snRNA recognition and pre-mRNA 
catalysis in {46). 

Spliceosome isolation 
and characterization 

We sought to purify the spliceosomal C complex 
because of its central role of bridging the two 
steps of a splicing reaction and catalyzing the sec- 
ond step. We modified a published purification 
protocol for the C complex {36, 47) and obtained 
approximately 500 pg of spliceosome from 4 liters 
of Schizosaccharomyces pombe culture (fig. SIA 
and supplementary materials, materials and meth- 
ods). The purified yeast spliceosome exhibited 
excellent solution behavior, as judged with gel fil- 
tration analysis (fig. SIB). The spliceosome con- 
tains three major RNA species— whose lengths are 
consistent with those of U2, U5, and U6 snRNAs 
from S. pombe— dndi a diverse array of RNA mol- 
ecules with varying lengths, which probably in- 
cludes the intron lariat (fig. SIC). In addition, the 
spliceosome contains a large number of protein 
components (fig. SID). 

To examine the identity of the spliceosome 
and to facilitate future structure assignment, we 
analyzed the sample using mass spectroscopy (MS). 
This approach detected ~80 spliceosomal proteins, 
with ~50 of these in high abundance (figs. SIE and 
S2). Most of these abundant spliceosomal pro- 
teins were components of U2 snRNP, U5 snRNP, 
NTC, and NTR (fig. SIE). These proteins are shared 
among the spliceosomal B^, B*, C, P, and ILS com- 
plexes. To differentiate among these complexes, 
we performed reverse transcription polymerase 
chain reactions (RT-PCR) on the cut6 gene using 
the purified spliceosome (fig. S3). The intron lariat 
was present in the spliceosome, suggesting com- 
pletion of the first-step reaction. However, the un- 
spliced cut6 gene and the ligated exon were also 
detected, suggesting the presence of the B*^ and/or 
B* and P complexes, respectively (fig. S3). We con- 
clude that the purified spliceosome contains a 
mixture of the different complexes. Last, after 
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chemical crosslinking, we uncovered by means 
of MS analysis 78 intermolecular interactions 
among the spliceosomal proteins (fig. S4), which 
proved important for structure assignment. 

Structure determination 

We first generated a low-resolution reconstruc- 
tion of the yeast spliceosome by using negative 
staining (fig. S5). Then, we imaged the sample 
under cryogenic conditions with a K2 direct 
electron detector, mounted on a Titan Krios mi- 
croscope operating at 300 kV. A total of 2246 
micrographs were collected (Fig. lA and table SI), 
and we subjected 224,450 particles, picked semi- 
automatically, to particle sorting and reference- 
free two-dimensional (2D) classification (Fig. IB 
and figs. S6 and S7A). We performed 3D classi- 
fication for 133,901 particles. The vast majority 
(112,795) of these particles produced a density 
map at an overall resolution of 3.9 A, which was 
further improved by means of particle polishing 
to 3.6 A on the basis of the gold-standard Fourier 
Shell Correlation (FSC) criteria (Fig. 1C). The actual 
resolution within the spliceosome ranged from 
2.9 to 3.6 A in the core region to 7 to 8 A in the 
periphery (Figs. ID, 2, and 3A). The quality and 
resolution of EM density for different regions 
of the spliceosome were improved by applying 
individual local masks (figs. S7B, S8, and S9). 



Throughout the spliceosome, most secondary 
structural elements were clearly visible, and a 
large proportion of the amino acid side chains 
were well defined (figs. SIO to S17). 

The spliceosome has an extended and asym- 
metric morphology, with the longest dimension 
exceeding 300 A (Fig. 2). The bulk of the well- 
defined density can be attributed to a triangular 
Central Body (Figs. ID and 2), which is connected 
to a Head group and two Arms (named I and II). 
The Head and Arm II in our structure appear to 
correspond to the “head-like” and “ridge” do- 
mains in the 29 A cryo-EM structure of the S. 
pornbe U5.U2/U6 complex {36). An elongated 
tubular EM density in Arm II, connected to a 
donut-shaped density, is visible at 20 A resolu- 
tion (Fig. 2A); at higher resolutions, this density 
can be assigned as a tetrameric assembly of Cwf8/ 
Prpl9, Cdc5 C-terminal fragment, and Cwf7. At 
10 and 5 A resolutions (Fig. 2, B and C), these 
structural features become more prominent, al- 
lowing identification of most secondary struc- 
tural elements. 

Identification of protein and 
RNA components 

We combined de novo atomic model building 
and homologous structure modeling to gener- 
ate an atomic model for the entire S. pornbe 
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Fig. 1. Ciyo-EM analysis of a yeast spliceosome. (A) A representative cryo-EM micrograph of the 
yeast spliceosomal complex. An entire micrograph is shown. Scale bar, 30 nm. (B) Representative 2D 
class averages of the yeast spliceosome. (C) The overall resolution is estimated to be 3.6 A on the basis 
of the gold-standard FSC criteria of 0.143. (D) An overall view of the EM density for the yeast 
spliceosome. The resolution varies among different regions of the yeast spliceosome, as indicated by 
the different colors. The surface view of the spliceosome is shown here. The resolution reaches 2.9 to 
3.2 A in the center of the spliceosome. 



spliceosome (Fig. 3 and table S2). On the basis 
of the EM density, we identified the heptameric 
Sm ring and the WD40-repeat proteins Cwfl/ 
Prp5, Cwf8/Prpl9, and Cwfl7 (fig. S18). The large 
proteins Spp42 (Prp8 in S. cerevisiae), CwflO 
(Snull4 in S. cerevisiae), and Cwfll, each with an 
available homologous structure {16, 48, 49), were 
docked into the EM density. Model building of 
Spp42 allowed assignment of U5 snRNA, which 
is mostly bound by the N-terminal 800 amino 
acids of Spp42. These procedures were followed 
by identification of the tetrameric assembly of 
Cwf8/Prpl9, which resides in Arm II of the splice- 
osome; the superhelical proteins Cwf3/Syfl and 
Cwf4/Syf3, which connect the Central Body to the 
Head and Arm I; and several other proteins, 
including Cdc5, Cwf2/Prp3, Cwfl4, Prp45, Cwfl9, 
Cwf5/Ecm2, and Prpl7 (fig. S18). The EM density 
for Arm I is weak. A local mask refinement after 
two rounds of 3D classification generated an 
improved map and enabled subsequent identifi- 
cation of Leal, Msll, a portion of U2 snRNA, 
and the Sm ring for U2 snRNA (fig. S18). In 
addition to EM density, assignment of the U2 
and U6 snRNAs was facilitated by the location 
of known interacting proteins, predicted sec- 
ondary structures, and published base-pairing 
specifics (table S2). After identification of U2 and 
U6 snRNAs, the RNA lariat was located, and the 
BPS and 5'SS were tentatively assigned. Last, on 
the basis of EM density and MS identification of 
crosslinked proteins (fig. S4), we assigned Cwf7, 
Cwfl5, and Cypl. 

Overall structure 

The final refined atomic model of the yeast 
spliceosome contains 10,574 amino acids from 37 
proteins, three snRNA molecules, and an intron 
lariat (Fig. 3B and tables SI and S2), with a com- 
bined molecular mass of -1.3 MD. Among the 
modeled amino acids, 9312 were assigned spe- 
cific side chains, and the remaining 1262 residues 
were built into a poly(Ala) model. U2, U5, and U6 
snRNAs contain a total of 405 nucleotides, of 
which 314 were tentatively assigned in our atomic 
model. We also modeled 18 nucleotides from the 
intron lariat. Some areas of very weak EM den- 
sity, probably reflecting dynamic components of 
the spliceosome, remain unassigned. The protein 
components in the atomic model include all 10 
core proteins of U5 snRNP (except Brr2), all nine 
core proteins of U2 snRNP, eight of the nine core 
proteins of NTC, and five of the eight core pro- 
teins of NTK Nearly complete atomic models are 
now available for Spp42 (residues 47 to 2030), 
which, as a central component of the U5 snRNP, 
anchors the catalytic center of the spliceosome, 
and for CwflO (residues 68 to 971), which, as the 
only guanosine triphosphatase (GTPase) among 
the spliceosomal components, regulates the splic- 
ing reaction {50). 

The U2 snRNP— comprising Leal, Msll, the 
heptameric Sm ring, and U2 snRNA— constitutes 
Arm I of the spliceosome. Arm I is linked to the 
Central Body through two associations: one with 
the superhelical proteins Cwf3/Syfl and Cwf4/ 
Syf3 and another through U2 snRNA, which 
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enters the catalytic center (Fig. 3B). Arm II of the 
spliceosome consists of an elongated tetramer- 
ic assembly of Cwf8/Prpl9 that is wrapped by 
Cwf7 and Cdc5 in their extended conformations 
(Fig. 3B). Cdc5, in turn, directly contacts Cwf3/ 
Syfl, Cwf4/Syf3, the NTC component Cwf7, and 
the U5 snRNP protein Cwfl7. The Head region 
comprises mainly Cwfll and is connected to the 
triangular Central Body through the superhelical 
proteins Cwf3/Syfl and Cwf4/Syfi3. The base of the 
Central Body is mainly composed of U5 snRNP 
(Fig. 3B). All other proteins are located at the 
center of the Central Body, making direct or in- 
direct contacts with the RNA molecules at the 
catalytic center. 



The catalytic center, identified by the U2/U6 
snRNA triplex, is located at the heart of the 
Central Body (Fig. 3B). The catalytic center is 
more than 100 A away from the Head region, 
either Arm, and the corner of the Central Body 
(as defined by the 3' end of U5 snRNA). The dis- 
tance between the tip of the Head region and the 
far corner of the Central Body measures -335 A, 
whereas the two Arms are separated by a dis- 
tance of up to 320 A (Fig. 3B). The Head and two 
Arms of the spliceosome are linked to the Central 
Body through limited contacts, which may engen- 
der conformational flexibility. Both the large size 
and the extended organization of the spliceosome 
are likely to be functionally important for proper 



splicing of pre-mRNAs with varying lengths and 
sequences. 

Structure of Spp42 

At 270 kD, Spp42 is the largest and most con- 
served component of all spliceosomal proteins 
{51), displaying 63 and 73% sequence identity 
with its functional ortholog Prp8 in S. cerevisiae 
and humans, respectively. As a major compo- 
nent of U5 snRNP, Spp42 serves as a central 
scaffold for pre-mRNA splicing. Except for the 
N-terminal 46 amino acids and the C-terminal 
Jabl/MPN domain, most of the Spp42 sequen- 
ces have a well-defined EM density (fig. SIO, A 
to H, and fig. SllA). The structure of Spp42 



A “20 A resolution 





Fig. 2. Structural features of a yeast spliceosome at different resolution 
limits. (A) The EM density map of the yeast spliceosome at an averaged 
resolution of 20 A. The spliceosome contains a triangular Central Body, a Head, 
and two Arms (I and II). The four perpendicular views illustrate the overall 
structural features. The same four views are shown in (B) and (C) for com- 
parison. (B) The EM density map of the yeast spliceosome at an averaged 



resolution of 10 A. The improved resolution of the structural features allows 
docking of homologous structures and assignment of many protein compo- 
nents. (C) The EM density map of the yeast spliceosome at an averaged res- 
olution of 5 A. Most secondary structural elements of the protein components 
can be assigned at this resolution. Connectivity between the adjacent second- 
ary structural elements is mostly clear. 
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includes the following domains: N-terminal (res- 
idues 47 to 825), reverse transcriptase (RT) Palm/ 
Finger (residues 826 to 1210), Thumb/X (resi- 
dues 1211 to 1327), Linker (residues 1328 to 1602), 
endonuclease-like (residues 1603 to 1783), and 
ribonuclease (RNase) H-like (residues 1784 to 
2030) (Fig. 4A). 

Spp42 looks like a thick triangular plate, with 
the three sides measuring 175, 150, and 125 A and 
a thickness of ~80 A (Fig. 4A). The N-terminal 
domain of Spp42, the structure of which has not 
previously been described, binds the GTPase CwflO 
and recognizes the bulk of U5 snRNA (52, 53). In 
the structure, the N-terminal domain adopts an 
extended conformation, with three protruding 



structural elements reaching out to contact CwflO, 
Cwfl7, and the RT Palm/Finger domain of Spp42 
(Fig. 4B). Spp42 can be superimposed onto Prp8 
{16) with a root mean square deviation (RMSD) 
of 1.46 A over 859 aligned Ca atoms in the RT 
Palm/Finger, Thumb/X, Linker, and endonuclease- 
like domains (Fig. 4, C and D). These four con- 
tiguous domains constitute a conserved structural 
scaffold between Spp42 and Prp8. 

The reported Prp8 structure contains a C- 
terminal Jabl/MPN domain (Fig. 4C) {16). Place- 
ment of the Jabl/MPN domain depends on the 
U5 snRNP assembly factor Aar2, which contrib- 
utes a pairing p strand between the RNase H- 
like and Jabl/MPN domains {16). Superposition 



of Spp42 and Prp8 revealed marked differences 
between the RNase H-like domains (Fig. 4D). 
Compared with Prp8, the RNase H-like domain 
in Spp42 undergoes a rotation of -25° away from 
the RT Palm/Finger domain, resulting in a trans- 
lation of up to 28 A for its protruding p hairpin 
(Fig. 4E). The space vacated by the RNase H-like 
domain is in turn occupied by Cwfl9, which inter- 
acts closely with both the RNase H-like and the 
RT Palm/Finger domains (Fig. 4A). The Jabl/MPN 
domain, responsible for binding the ATPase/ 
helicase Brr2 {17, 18), is flexible and disordered in 
our structure. Consequently, despite its presence 
in the spliceosome (fig. SIE), Brr2 has no visible 
EM density, probably reflecting its highly dynamic 
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Fig. 3. Structure of a 
yeast spliceosome 
at 3.6 A resolution. 

(A) The EM density 
map of the yeast spli- 
ceosome at an overall 
resolution of 3.6 A. 
Four perpendicular 
views around the x 
axis and two addi- 
tional views around 
the y axis show the 
color-coded protein 
and RNA compo- 
nents. (B) A repre- 
sentative illustration 
of the yeast spliceo- 
some from two per- 
pendicular views. The 
protein and RNA 
components are 
color-coded. This 
structure includes 37 
proteins, three 
snRNAs, and one RNA 
lariat, with a com- 
bined molecular 
weight of -1.3 MD. 
Among the modeled 
10,574 amino acids, 
9312 have been 
assigned side chains. 
All 332 RNA nucleo- 
tides were tentatively 
assigned. 
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State. This structural arrangement would place 
Brr2 in close proximity to the catalytic center and 
allow ample conformational freedom for Brr2 to 
apply its ATP-dependent RNP remodeling activ- 
ity in both steps of the splicing reaction. 

Catalytic cavity of the spliceosome 

Spp42 plays a key role in pre-mRNA splicing. 
Splicing defects as a result of splice-site mutation 



can be suppressed by rescuing mutations in Prp8 
(54-56). Based on examination of these mutation- 
targeted residues, the presence of a catalytic cav- 
ity on Prp8, formed mainly by the Linker and 
RNase H-like domains, has been proposed (16). 
Analysis of the electrostatic surface potential of 
Spp42 revealed a striking cavity that is highly en- 
riched by positively charged amino acids (Fig. 4F) 
but differs from the proposed catalytic cavity 



(16). Instead, this cavity is formed between the N- 
terminal domain and the Thumb/X-Linker re- 
gion of Spp42 and hence can only be recognized 
in the full-length Spp42 protein. The RNA triplex 
of U2 and U6 snRNAs and the intron lariat are 
located in this cavity (Fig. 4G), making close con- 
tact with the positively charged amino acids. The 
positively charged residues in this catalytic cavity 
are invariant among S. pomhe, S. cerevisiae, and 



Fig. 4. Structure of 
the central protein 
component Spp42 
(Prp8 in S. 
cerevisiae). (A) The 
overall structure of 
Spp42. The structure 
contains the following 
domains: N-terminal 
(green, residues 47 to 
825), RT Palm/Finger 
(blue, residues 826 to 
1210), Thumb/X 
(cyan, residues 1211 to 
1327), Linker (pale 
green, residues 1328 to 
1602), endonuclease- 
like (pink, residues 
1603 to 1783), and 
RNase H-like (purple, 
residues 1784 to 2030). 
The Jabl/MPN 
domain (residues 
2044 to 2363) is 
disordered, and its 
approximate position is 
occupied by Cwfl9 
(red). (B) The struc- 
ture of the N-terminal 
domain of Spp42. The 
N-terminal helix is 
responsible for binding 
to Cwfl7 and the lasso 
on the right corner 
interacts with CwflO. 

(C) The structure of 
Prp8 (residues 885 to 
2413) from S. cerevisiae 
(16). The RNase 
H-like and Jabl/MPN 
domains are colored 
light green and gold, 
respectively. (D) Struc- 
tural comparison 
between Spp42 and 
Prp8. The structures 
from the Palm/Finger 
domain to the 
endonuclease-like 
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domain are very similar. 

However, the RNase H-like domain exhibits a major conformational shift. (E) A 
close-up view of the RNase H-like domains of Spp42 and Prp8. The RNase H-like 
domain of Spp42 can be aligned to its counterpart in Prp8 by a translation of 
~20 A and a clockwise rotation of 25° around an axis perpendicular to the page. 
(F) Analysis of the electrostatic surface potential of Spp42 reveals a putative 
catalytic center. A central hole at the interface between the N-terminal domain 



and the Thumb/X and Linker domains is enriched by positively charged amino 
acids and probably represents the catalytic center (left). In contrast, the 
previously identified catalytic cavity lacks such positive potential (bottom 
right). Another region enriched by positive electrostatic potential (top right) 
represents the binding site for U5 snRNA. (G) U2 snRNA, U6 snRNA, and 
the intron lariat are bound at the catalytic center of Spp42. 
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Fig. 5. Protein components in U5 snRNP. (A) Structure of the EF-2-family GTPase CwflO. The bound 
GDP molecule is shown in stick representation. The extended N terminus binds Spp42. (B) An overall 
view of the interactions between GwflO and Spp42. The N-terminal sequences of GwflO recognize the 
Palm/Finger domain of Spp42, whereas the N-terminal domain of Spp42 forms a lasso over the back 
side of the nucleotide-binding domain of GwflO. (C) A close-up view of the interface between the N- 
terminal sequences of GwflO and a shallow surface groove on the Palm/Finger domain of Spp42. (D) A 
close-up view of the Spp42 lasso over the N-terminal domain of GwflO. In particular, three phenylalanine 
residues from Spp42 make a number of van der Waals contacts to hydrophobic amino acids on the 
surface of GwflO. (E) Gwfl7 and its interactions with Spp42 and SmBl. The N-terminal a-helix from 
Spp42 binds the bottom face of the Gwfl7 WD40 propeller, whereas the G-terminal sequences of 
SmBl interact with the outer surface of the propeller. Detailed interactions are shown in the two insets. 
(F) Structure of the heptameric Sm ring. The Sm ring directly interacts with the N-terminal domain of 
GwflO (blue in the background). Detailed interactions are shown in the inset. The Sm ring recognizes a 
stretch of U-rich RNA sequences at the 3' end of U5 snRNA. 



humans (fig. S19A), allowing modeling of a sim- 
ilar catalytic cavity in Prp8 (fig. S19B). 

Other protein components in U5 snRNP 

A major component of the U5 snRNP, CwflO 
(Snull4 in S. cerevisiae and U5-116K in humans), 
is a translation elongation factor 2 (EF-2)-like 
GTPase that regulates the RNP remodeling ac- 
tivity of Brr2 (57). The excellent EM density for 
CwflO allows atomic modeling of nearly the en- 
tire length of the protein and of a bound gua- 
nosine diphosphate (GDP) molecule (Fig. 5A and 
fig. Sll, B and C). CwflO closely interacts with 
Spp42 through two discrete, reciprocal interfaces 
(Fig. 5B). Residues from the extended N-terminal 
sequences of CwflO bind to a surface groove on 
Spp42, making specific interactions with amino 
acids in the groove (Fig. 5C). In addition, an ex- 
tended loop in the N-terminal domain of Spp42 
forms a lasso over the nucleotide-binding domain 
of CwflO (Fig. 5D). In particular, three phenyl- 
alanine residues from the lasso (Phe^^^, Phe^^®, 
and Phe^^^) are nestled in a greasy surface pocket 
formed by hydrophobic amino acids in CwflO 
(Fig. 5D). 

Cwfl7/Spf38 (U5-40K in humans) is a WD40- 
repeat protein that is a core component of U5 
snRNP. The functional ortholog of Cwfl7/Spf38 
in S. cerevisiae is yet to be identified. In the struc- 
ture, an N-terminal a-helix of Spp42 associates 
with the bottom face of the WD40 repeats through 
specific hydrogen bonds and van der Waals con- 
tacts (Fig. 5E). The last core component of the U5 
snRNP is the heptameric Sm ring, which orients 
the 3' end sequences of U5 snRNA and specifical- 
ly interacts with the N-terminal domain of CwflO 
(Fig. 5F). The extended C-terminal sequences of 
SmBl reach out to contact the outer surface of the 
Cwfl7 P propeller, forming a number of hydrogen 
bonds (Fig. 5E). 

Protein components in NTC and NTR 

A shared feature of the spliceosomal proteins is 
their extended architecture. This is best illus- 
trated by the tetrameric assembly of Cwf8/Prpl9, 
which measures up to 175 A in length (Fig. 6A). 
Two molecules of Cwf8/Prpl9 form an inter- 
twined dimer through their elongated coiled-coil 
region, with the two U-boxes distant from each 
other. Two such Cwf8/Prpl9 dimers use their re- 
spective U-boxes to further associate with each 
other (Fig. 6A). This structural arrangement may 
allow considerable conformational flexibility and 
markedly increase surface areas for potential in- 
teractions with other spliceosomal factors. 

The extended architecture is also exemplified 
by Cwf3/Syfl and Cwf4/Syf3. Both of these are 
superhelical proteins, containing 19 and 16 half- 
a-tetratricopeptide (HAT) repeats {58), respec- 
tively, and exhibiting a t\visted C-shaped morphol- 
ogy (Fig. 6, B and C). The maximal intramolecular 
distance for these proteins exceeds 150 A. The 
middle portion of the convex side of Cwf3/Syfl 
associates with the central region of the concave 
surface of Cwf4/Syf3 (Fig. 6D). HAT-repeat pro- 
teins exhibit conformational plasticity (58); the 
presence of Cwf3/Syfl and Cwf4/Syf3 in the 
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spliceosome probably ensures adequate confor- 
mational freedom for the splicing reactions and 
exchange of cofactors. Cwf8/Prpl9, Cwf3/Syfl, 
and Cwf4/Syf3 are all core components of NTC, 



supporting an important regulatory role for this 
complex. 

Prp45, a core component of NTR, has been 
predicted to be intrinsically disordered {59). Our 



structure confirms the prediction: Prp45 only 
contains extended secondary structural elements 
that span a distance of over 150 A (Fig. 6E). Prp45 
appears to promote spliceosome assembly by di- 
rectly interacting with, and thus linking together, 
at least nine distinct proteins, including Spp42, 
Cwfl7, and Cwf4/Syf3 (Fig. 6F and fig. S20). 
Prp45 also directly interacts with U2 and U6 
snRNAs at the catalytic center. Such an arrange- 
ment allows the conformational changes at the 
catalytic center to be propagated to any distant 
comer of the spliceosome through the connection 
among Prp45, Cwf4/Syf3, and Cwf3/Syfl. Thus, 
similar to Cwf8/Prpl9, Cwf3/Syfl, and Cwf4/Syf3, 
the morphology and the placement of Prp45 
seem to have evolved to facilitate the dynamic 
process of pre-mRNA splicing. 

Our stmcture includes a number of other NTC 
and NTR components. These include five NTC 
core components: (i) Cdc5, which plays an impor- 
tant role in pre-mRNA splicing by stabilizing the 
second-step spliceosome {60-62); (ii) Cwf7, which 
modulates interactions of Prpl9 with other asso- 
ciated cofactors {63, 64); (iii) Cwf2/Prp3, which 
helps link NTC to the catalytic center; (iv) Cwfl/ 
Prp5, which contains a 7-bladed p propeller; and 
(v) Cwfl5, which interacts with U5 snRNA Pip5, 
and Spp42. In addition, we modeled four NTR 
components: (i) Cwf5/Ecm2, which is involved in 
base-pairing interactions of U2/U6 helix II {65); 
(ii) Cwfll, which is an armadillo domain contain- 
ing RNA helicase {49); (iii) Cwfl4, which facili- 
tates both steps of the splicing reaction {66); and 
(iv) Cwfl9, the human ortholog of which is thought 
to be involved in the development of recessive 
ataxia syndrome {67). We also identified Prpl7 and 
the prolyl isomerase Cypl in the EM density map. 
Stmctural mapping of these proteins in the splice- 
osome allows critical assessment of their functions 
in pre-mRNA splicing (Fig. 6, G to K, and fig. S21). 

Assembly of snRNPs and NTC 

The atomic model of the yeast spliceosome main- 
ly comprises four subcomplexes— U2 snRNP, U5 
snRNP, NTC, and NTR— which are intertwined 
with one another to form an asymmetric assem- 
bly (Fig. 7A). The U2/U6 duplex and the intron 
lariat are partially exposed on the surface of the 
spliceosome and accessible to the modifying en- 
zymes, such as the ATPase/helicase Brr2. The U5 
snRNP, located at the bottom of the triangular 
Central Body, serves as the base of the entire 
spliceosome (Fig. 7B). At the heart of the Central 
Body is an interconnected RNA assembly involv- 
ing U2 snRNA U6 snRNA and the intron lariat. 
Loop I of U5 snRNA is located close to the U2/U6 
duplex (Fig. 7B). NTC and NTR together form a 
supporting network above the U5 snRNP and 
the centrally located RNA molecules. 

The NTC is characterized by a large and ex- 
tended architecture, 200 A in height and width 
and 170 A in thickness (Fig. 8A). The eight iden- 
tified protein components in NTC are mostly in- 
terconnected, constituting a large scaffold. In 
contrast, the five protein components of NTR 
are mostly unconnected (Fig. 8B) and only come 
together through interactions with the NTC 




Fig. 6. Representative structures of individual protein components. (A) The tetrameric assembly of 
Cwf8/Prpl9. The U-box dimer is highlighted in the inset. (B) The structure of the HAT-repeat superhelical 
protein Cwf3/Syfl. (C) The structure of the HAT-repeat superhelical protein Cwf4/Syf3. (D) Cwf3/Syfl 
directly interacts with Cwf4/Syf3, both using HAT repeats in the middle portions of their respective 
structures. (E) The structure of Prp45. The 216 amino acids in Prp45 span 156 A. (F) Prp45 interacts 
with at least nine protein components and two snRNAs. Prp45 is shown in surface view. (G) The struc- 
ture of the WD40-repeat protein Cwfl/Prp5. (H) The structure of the NTR component Cwfl4. Cwfl4, 
which is thought to facilitate both steps of the splicing reaction, contains three zinc ions (blue spheres). 
(I) Structure of the functionally unknown NTR component Cwfl9. (J) The NTR component Cwf5/Ecm2, 
which is thought to be involved in base-pairing interactions of U2/U6 helix II, exhibits an extended 
structure. (K) The structure of the NTC component Cdc5, which is thought to serve as a scaffold to 
recruit other protein factors. Of these proteins, no relevant structural information was previously 
available for Cwfl9, Prp45, the N-terminal region of Cwf5, the N-terminal region of Spp42, or the C- 
terminal region of Cdc5. The other proteins have homologous structures, but this is the first time that 
they have been reported in S. pombe. 
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Fig. 7. The four multicomponent subcomplexes U5 snRNP, U2 snRNP, NTC, 
and NTR closely associate with one another to form the yeast spliceosome. 

(A) U5 snRNP forms a scaffold onto which the other three subcomplexes dock 
around the RNA molecules and the catalytic center. The protein components 
within the same snRNP are colored identically: yellow for U5 snRNR cyan for U2 



snRNR light pink for NTC, and brown for NTR. The four RNA molecules are in- 
dividually colored. (B) Relative positions of the four multicomponent subcomplexes 
and the RNA molecules within the spliceosome. The protein components within 
each subcomplex are differentially color-coded. These five views of the yeast 
spliceosome have the same general orientation as that of the first view in (A). 



components (Fig. 8C). Together, NTC and NTR 
define a large central space, part of which is 
occupied by the catalytic center (Fig. 8C). 

Dynamic conformation of the 
spliceosome 

Most of the spliceosomal particles display a well- 
defined conformation, allowing us to obtain a 3D 
structure at an overall resolution of 3.6 A. The 
high-resolution regions are largely restricted to 
the Central Body, which appears to have a rigid 
conformation. The peripheral regions occupied 
by components of NTC and NTR exhibit consid- 
erable conformational flexibility, which poses a 
challenge to any effort aimed at improving the 
local resolution. In addition to local conforma- 
tional variations, large-scale structural changes 
are also associated with the spliceosome. We were 
able to obtain two conformational states, with 
37,622 and 49,079 particles, through an addition- 
al round of 3D classification. We low-pass filtered 
both maps to 10 A and compared the conforma- 
tional changes (fig. S22A). Superposition of these 
two structures revealed a major movement of the 
Head region toward Arm I (fig. S22B). As a con- 
sequence of this movement, much of the already 
flexible U2 snRNP becomes disordered. 

Discussion 

Structure determination by means of x-ray crys- 
tallography has been focused on the individual 
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components or subcomplexes of the spliceosome, 
yielding valuable information on a number of im- 
portant proteins— namely, the structures of the 
core components of U1 snRNP (21-23), U2 snRNP 
(68-71), and U4 snRNP (19)\ two subcomplexes 
of the U6 snRNP (20, 72)\ a large fragment of 
Prp8 (16)\ and Brr2 (17, 18). These structures pro- 
vide individual pieces of the spliceosome jigsaw 
puzzle. In the past decade, EM-based studies 
have produced structural insights into the intact 
spliceosome at various stages of the splicing re- 
action (73). These studies were performed on the 
mammalian spliceosomal A complex (29), B com- 
plex (31, 38, 40), B* complex (30), C complex 
(33, 35), and P complex (42), as well as on the 
yeast B complex (32), B^ complex (32), the U5. 
U2/U6 complex or C complex (32, 36), and the 
ILS complex (43). The highest resolution reported 
in these EM studies is 20 to 29 A, which only allows 
description of general features of the spliceosome. 
After submission of this Research Article, the 
cryo-EM structure of the spliceosomal U4/U6. 
U5 tri-snRNP in S. cerevisiae was reported at 5.9 
A resolution (45), which allows domain identifi- 
cation and secondary-structure assignment for 
some components. The U4/U6.U5 tri-snRNP is 
an important complex for the assembly of the 
functional spliceosomes. 

Here, we report the cryo-EM structure of an 
intact S. powbe spliceosome at a near-atomic res- 
olution of 3.6 A. The resolution exceeds 3.2 A in 



the core region of the spliceosome, which covers 
the bulk of the U5 snRNP and a number of other 
proteins from NTC and NTR. On the basis of the 
EM density, we have generated the first atomic 
model of an intact, functional spliceosome. We 
have provided an overall and preliminary anal- 
ysis of the spliceosome structure. The enormous 
amount of information contained therein requires 
additional analysis. There are over 100 discrete 
protein-protein and protein-RNA interfaces among 
the 37 proteins and four RNA molecules of the 
spliceosome, involving a substantial amount of 
buried surface area. There are numerous other 
details that warrant attention; for example, Cwf5, 
Cwfl4, and Cwfl9 contain three different zinc- 
binding motifs (fig. S23). Determination of a 
near-atomic-resolution structure of an intact 
spliceosome represents a major milestone. Insights 
gained from structural analysis will greatly im- 
prove the mechanistic understanding of pre- 
mRNA splicing. 

Although we intended to purify the spliceosomal 
C complex, the purified sample included a mix- 
ture of different spliceosomes, as judged by use 
of RT-PCR (fig. S3). Because Cdc5, which is re- 
cruited into the spliceosomal B^ complex, was 
used as the affinity tag for purification, the puri- 
fied sample should be restricted to the B^, B*, C, 
P, and ILS complexes. This finding is corroborated 
by the preponderance of NTC and NTR compo- 
nents (which are also recruited into the B^^ 
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Fig. 8. Structures of the 
NTC and NTR. (A) The over- 
all structure of the NTC. The 
color-coded NTC compo- 
nents are mostly intercon- 
nected through direct 
interactions. (B) Overall 
structure of the NTR. In con- 
trast to NTC, components of 
the NTR make no direct 
contacts with each other and 
are mostly unconnected. 

(C) NTC and NTR surround 
the catalytic center. The 
catalytic center is identified 
by three intertwined RNA 
molecules: U6 snRNA 
(green), U2 snRNA (blue), 
and the intron lariat 
(magenta). The U5 snRNA 
(orange) is located next to 
the intertwined RNAs. 




complex) (figs. S2 and S4). The EM density is well 
defined for the intron lariat, suggesting that the 
structure probably reflects the C, P, or ILS com- 
plex, or a mixture of the three complexes. The 
EM density for 5 -exon is weak, suggesting that 
either the ILS complex was a major species in the 
final EM sample or 5 -exon was lost during pu- 
rification. These complexes share a large set of 
stable spliceosomal components, allowing deter- 
mination of the cryo-EM structure at 3.6 A res- 
olution. The different spliceosomal complexes in 
the EM sample, major or minor, will probably be 
resolved upon collection and analysis of a larger 
set of micrographs. 

Pre-mRNA splicing and protein translation, two 
central processes in eukaryotes, are each carried 
out by supramolecular protein-RNA machineries: 
spliceosome and ribosome, respectively. The 80S 
eukaryotic ribosome consists of two well-organized 
subcomplexes: the 60S large subunit and the 40S 
small subunit. In both subunits, the core protein 
components and the RNA molecules, in a mass 
ratio of approximately 1:1, form relatively stable 
RNP complexes. A flux of protein factors regulates 
the initiation, elongation, and termination of pro- 
tein synthesis. In contrast to the ribosome, the 
spliceosome has a mass ratio of at least 10:1 in 
favor of the protein components and exhibits ex- 
treme dynamism in both composition and confor- 
mation. In the ribosome, the protein components 
are mainly located on the exterior, away from the 
catalytic center for peptide bond formation. In the 
spliceosome, the protein components surround 
and support the RNA-based catalytic center. The 
ribosome exhibits a generally isometric shape. 



whereas the spliceosome has a highly asymmetric 
morphology, with numerous surface cavities of 
varying sizes and cut-through spaces. Apparently, 
through evolution, two highly divergent strategies 
have been adopted for assembling RNPs to achieve 
complex functions in gene expression. 
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STRUCTURAL BIOLOGY 

Structural basis of pre-mRNA splicing 

Jing Hang,* Ruixue Wan,* Chuangye Yan,* Yigong Shif 

Splicing of precursor messenger RNA is performed by the spliceosome. In the cryogenic 
electron microscopy structure of the yeast spliceosome, U5 small nuclear ribonucleoprotein 
acts as a central scaffold onto which U6 and U2 small nuclear RNAs (snRNAs) are 
intertwined to form a catalytic center next to Loop I of U5 snRNA. Magnesium ions are 
coordinated by conserved nucleotides in U6 snRNA. The intron lariat is held in place through 
base-pairing interactions with both U2 and U6 snRNAs, leaving the variable-length middle 
portion on the solvent-accessible surface of the catalytic center. The protein components 
of the spliceosome anchor both 5' and 3' ends of the U2 and U6 snRNAs away from the 
active site, direct the RNA sequences, and allow sufficient flexibility between the ends and 
the catalytic center. Thus, the spliceosome is in essence a protein-directed ribozyme, with 
the protein components essential for the delivery of critical RNA molecules into close 
proximity of one another at the right time for the splicing reaction. 



I n eukaiyotic cells, the coding exons in a fresh- 
ly transcribed precursor messenger RNA 
(pre-mRNA) are interdispersed by non- 
coding introns that must be removed before 
protein translation. Removal of the introns is 
carried out by the spliceosome, a dynamic ribo- 
nucleoprotein (RNP) machine that comprises 
more than 100 protein components and five 
small nuclear RNAs (snRNAs) (7). Pre-mRNA 
splicing occurs through two steps, which are 
both SN2-type transesterification reactions (2, 3). 

In the first step of pre-mRNA splicing, the 
2 -OH group of a conserved RNA adenine nucleo- 
tide in the branch point sequence (BPS) of an 
intron initiates a nucleophilic attack on the 
phosphorous atom of the guanine nucleotide 
at the 5' end of the intron, resulting in the re- 
lease of 5'-exon and formation of an intron lariat- 
3 '-exon intermediate (hereafter, intron lariat) 
(Fig. lA). In the second step, the 3'-OH group of 
the RNA nucleotide at the 3' end of the 5 '-exon 
unleashes a second nucleophilic attack on the 
phosphorous atom of the RNA guanine nucle- 
otide at the 5' end of the 3 '-exon, leading to 
joining of two exons and release of the intron 
lariat (Fig. lA) (3). Although the nature of the 
two-step reaction has been clearly defined for 
decades, how the spliceosome facilitates such 
reaction remains largely enigmatic. How are the 
reacting pieces placed into close proximity of 
one another in the correct temporal order? Given 
the varying lengths of the exons and introns, 
how does the spliceosome accommodate pre- 
mRNA and hold the 5'-exon for both steps of the 
reaction? 

The spliceosome is a metalloenzyme, and a 
number of conserved RNA nucleotides directly 
coordinate at least two magnesium ions (Mg^^) 
that collectively catalyze the two-step reaction 
(2, 4-7). The reaction mechanism of pre-mRNA 
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splicing is thought to closely resemble that of 
the group IIA or IIB self-splicing intron, each 
involving formation of an intron lariat, and differ 
from that of the group IIC intron, which splices 
by hydrolysis through a linear intron {8, 9). An 
in vitro reconstitution of U2 and U6 snRNAs re- 
vealed RNA splicing-like activity in the absence 
of the protein components, which strongly sup- 
ports the ribozyme hypothesis {10-13). But the 
protein components of the spliceosome are ob- 
viously indispensable for pre-mRNA splicing to 
proceed because, for example, defective splic- 
ing due to a mutated RNA sequence can be res- 
cued by mutations in Prp8 from Sojcchmomyces 
cerevisiae (Spp42 from Schizosaccharomyces 
pombe) (14-18). What are the functions of the pro- 
tein components of the spliceosome during the two- 
step reaction? This Research Article addresses 
these interrelated questions through structural 
analysis of the yeast spliceosome from S. pombe, 
with a focus on the RNA components. The over- 
all structure of the spliceosome and the features 
of the protein components are reported in (19). 

Organization of the RNA components 

The yeast spliceosome contains four distinct 
RNA molecules: U2 snRNA, U5 snRNA, U6 snRNA, 
and an intron lariat, which have been unambig- 
uously located in the electron microscopy (EM) 
density map (Fig. IB and figs. SI and S2). U2, U5, 
and U6 snRNAs contain 186, 120, and 99 nucleo- 
tides (nt), respectively. In the structure, a nine- 
nucleotide fragment at the 3' end of U6 snRNA 
(nt 91 to 99) and the middle and 3' end portion of 
U2 snRNA (nt 44 to 92, 146 to 152, and 178 to 186) 
are flexible, exhibit poor EM density, and remain 
to be assigned. In addition, six nucleotides at the 
5' end and nine nucleotides at the 3' end of U5 
snRNA are disordered. All other 314 nucleo- 
tides from the snRNAs have been specifically 
assigned, representing -80% of the total snRNA 
sequences. In addition, 18 nucleotides have been 
tentatively modeled for the intron lariat on the 
basis of conserved sequences for yeast (1). 

Among the four RNA molecules, U5 snRNA 
is mostly buried in the structure (Fig. IB). A large 
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Fig. 1. Overall organization of the RNA com- 
ponents in the spliceosome. (A) A schematic 
diagram of the two-step splicing reaction. The 
intron sequences are indicated with thick green 
lines. The phosphorous and oxygen atoms that 
are directly involved in bond formation are 
highlighted in red. PPT, poly-pyrimidine tract. 
(B) Overall organization of the RNA components 
in the spliceosome. The EM density is displayed 
for the entire spliceosome, with the protein compo- 
nents shown in gray and the RNA molecules 
color-coded. U2, U5, and U6 snRNAs and the 
intron lariat are colored blue, gold, green, and 
purple, respectively. The RNA coloring scheme 
is preserved throughout this manuscript. A 
close-up view is shown to highlight the catalytic 
center and the U2/U6 duplex. (C) Overall or- 
ganization of the RNA components in isolation. 
A surface view of the four pieces of RNA is 
displayed so as to indicate their overall dimen- 
sion and relative orientation. Figures 10; 2, A, 
B, D, and E; 3; 4; 5, A and C; 6, B and E; and 7, B 
and C, were prepared in PyMol (48). Figures 
IB; 5B; and 6, A, C, and D, were made by use of 
CHIMERA (49). 




proportion of the U2 and U6 snRNAs is surface- 
exposed, with a section of the U2/U6 duplex 
(known as Helix II) exposed at the tip of the 
Central Body. The 3' end portion of U2 snRNA 
is anchored to Arm I and largely exposed, where- 
as the 5' end portion is base-paired with the 
intron lariat and U6 snRNA (Fig. 1C). The 
catalytic center, as indicated by the intron lariat, 
is partially exposed to solvent and located at the 
heart of the Central Body on the side of Arm I 
(Fig. 1, B and C). Because of the endogenous na- 
ture of the yeast spliceosome, the intron is ex- 
pected to vary in length and sequence. The intron 
is placed on the surface of the catalytic center. 



allowing its middle portion of variable length 
to reach into unrestricted space. 

These RNA molecules form an extended and 
interconnected architecture (Figs. 1C and 2A). 
The longest dimension of the RNA architec- 
ture, as measured from the 3' end of U2 snRNA 
to the 5' end of U5 snRNA, exceeds 300 A (Fig. 
1C). Loop I of U5 snRNA is positioned close to 
the catalytic center and the Mg^^ ions (Fig. 2A). 
A large majority of the nucleotides in U5 snRNA 
from G7 to C89, form a continuous duplex, ex- 
cept for three short stretches: Loop I (C47 to 
A53), Loop II (A26 to A31), and Loop III (U78 to 
A83) (Fig. 2B). The U5 snRNA duplex is divided 



into Stems I, II, and III. The 5 ' end portion of 
U2 snRNA and the 3' end portion of U6 snRNA 
form an intermolecular duplex, known as Helix 
II (Fig. 2, A and C), with its tip located ~60 to 
70 A away from the nearest Mg^^ ion at the 
active site. The 3' end portion of U2 snRNA and 
the 5' end of U6 snRNA each forming intramo- 
lecular duplexes, are separated from the spliceo- 
somal active site by -130 and 90 A respectively. 

In contrast to their 5' and 3' ends, the middle 
portion of U2 or U6 snRNA pairs up with the 
intron lariat near the active site (Fig. 2, D and E). 
Specifically, the ACAGA box of U6 snRNA forms 
an intermolecular duplex with the 5 -splice site 
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Fig. 2. Structures of individual RNA components in the spliceosome. (A) An overall illustration of 
the RNA architecture. The catalytic center— which comprises the ISL of U6 snRNA, Helix I of the U2/U6 
duplex, Loop I of U5 snRNA, and the ions— is circled. The disordered RNA sequences are indicated 

with dotted lines. (B) Structure of U5 snRNA. An illustration of the U5 snRNA structure is shown at right, 
with its secondary structural elements indicated at left. The sequences in light color are invisible because 
of their intrinsic flexibility. (C) Overall base-pairing interactions among U2 snRNA, U6 snRNA, and the 
intron lariat. Base-pairing interactions in the RNA triplex between A41-G40, U68 and A47-G48-C49 are 
identified with solid lines. (D) Structure of U6 snRNA. The intron lariat is shown here to indicate its relative 
position to the active site. (E) Structure of U2 snRNA. The view in (D) and (E) is the same as that in (A). 



(5'SS) of the intron, whereas a number of nu- 
cleotides from U2 snRNA interact with the BPS 
(Fig. 2, C and E). Intriguingly, the 2'-OH group 
of the invariant adenine nucleotide in BPS is 
located ~20 to 22 A away from the reactive Mg^^ 
ions, suggesting that the catalytic Mg^^ ions may 
only be brought to the close proximity of BPS 
during the first-step reaction. 

Catalytic center and the active site 

The catalytic center of the spliceosome consists 
of Helix I of the U2/U6 snRNA duplex, the in- 
ternal stem loop (ISL) of U6 snRNA, Loop I of 



U5 snRNA, and Mg^^ ions (Fig. 3, A and B). A 
hallmark of the spliceosomal catalytic center is 
the triplex, which is formed through additional 
base-pairing interactions between the A47-G48- 
C49 triad of Helix Ib and three nucleotides, A41, 
G40, and U68 of U6 snRNA, respectively. The 
active site, featured by two Mg^^ ions and their 
coordinating phosphate groups, is located within 
the ISL of U6 snRNA and configured by the cat- 
alytic triplex (Fig. 3C). Both Mg^^ ions are coor- 
dinated by the phosphate group of U68, whereas 
the phosphate groups of A47, G48, and G66 also 
contribute to Mg^^ coordination (Fig. 3C). The 
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organization of the triplex brings the 5'SS close 
to the active site because the ACAGA box of U6 
snRNA recognizes the 5'SS and immediately 
precedes G40 to A41 of the triplex (Fig. 2C). In 
the meantime, U2 snRNA recognizes the highly 
conserved BPS, which contains the bulged branch 
adenine nucleotide, just downstream of Helix I, 
thus bringing BPS and 5'SS together and prim- 
ing the first catalytic step. 

The two Mg^^ ions, known as Ml and M2, 
play distinct roles during the two steps of splic- 
ing (2). During the first step, M2 activates the 
nucleophile— the 2'-0H of the branch nucleo- 
tide adenine from BPS— whereas Ml stabilizes 
the leaving group, the 3'-0H of the 3 '-end nu- 
cleotide in the 5'-exon. During the second step, 
the roles of Ml and M2 reciprocate, with Ml ac- 
tivating the nucleophile (3'-0H of the 3' end 
nucleotide in the 5'-exon) and M2 stabilizing the 
leaving group (3'-0H of the 3' end nucleotide in 
the intron). Ml is coordinated by both U and G 
of the UAG motif in the ISL of U6 snRNA, 
whereas M2 is only bound by the U nucleotide. 
Our structure is fully consistent with the pro- 
posed two-metal mechanism (Fig. 3C) (2). Ml, 
coordinated by the phosphate groups of U68 
and G66, is located close to Loop I of U5 snRNA, 
which is known to bind 5'-exon. This arrange- 
ment presumably places Ml in the proximity of 
the 3' end of 5'-exon. In addition to Ml and M2, 
there are at least two additional Mg^^ ions near 
the active site, which likely help stabilize the 
active site conformation by neutralizing the neg- 
ative charges from several closely spaced phos- 
phate groups (fig. S3A). 

Our structure provides direct experimental 
evidence that the spliceosome shares a gener- 
ally similar catalytic mechanism with the self- 
splicing group II intron (2). However, important 
differences remain. Splicing of pre-mRNA, in- 
volving precise removal of numerous introns 
of variable length and distinct sequences, re- 
quires the participation of many dozens of 
protein components in the spliceosome. The 
spliceosomal catalytic center is anchored in the 
catalytic cavity on the surface of Spp42 (Fig. 
3D). An inter-helical loop from the N-terminal 
domain of Spp42 follows a surface groove in 
the ISL of U6 snRNA, with the side chains of 
Arg68i^ Arg^^^, Lys^^^, and Lys^^^ inserting into 
the groove, perhaps stabilizing the conforma- 
tion of the ISL (fig. S3B). 

U5 snRNA and U5 snRNP 

As the central component of the spliceosome, 
U5 snRNP is the only invariant snRNP for both 
major and minor pre-mRNA splicing reactions 
and functions as a scaffold onto which the other 
snRNPs assemble (20). U5 snRNP in the struc- 
ture consists of U5 snRNA and the protein com- 
ponents Spp42 (Prp8 in S. cerevisiae), CwflO 
(Snull4 in S. cerevisiae), Cwfl7, and the hepta- 
meric Sm ring (Fig. 4A). The highly conserved 
Loop I and Stem I of U5 snRNA are bound pre- 
dominantly by secondary structural elements 
from the N-terminal domain of Spp42 (Fig. 4B). 
Loop II and a portion of Stem III of U5 snRNA 
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Fig. 3. The catalytic center and the active site of the spliceosome. 

(A) An overall illustration of the catalytic center of the spliceosome. 

Three perpendicular views are shown. The intron lariat, Helix II of the 
U2/U6 duplex, and Stem I of U5 snRNA are included so as to indicate the 
relative orientation. (B) A close-up view on the catalytic center. Helix I of the 
U2/U6 duplex is placed to the left of the two catalytic ions, and the ISL of 
U6 snRNA is close to Loop I of U5 snRNA. (C) A close-up view on the active 
site. Among the two catalytic Mg^^ ions, the first (Ml) is coordinated by 
phosphate groups from U68, G66, and G48 of U6 snRNA, and the second (M2) 



is bound by phosphates from U68, G48, and A47. A47 and G48 are part of the 
RNA triplex. (D) Both the catalytic center and U5 snRNA are anchored on 
Spp42.Two perpendicular views are shown. The catalytic center is placed into a 
positively charged, catalytic cavity on Spp42. Interactions between the active 
site and Loop I of U5 snRNA are mediated in a channel below the surface (left). 
This may allow retention of the 5'-exon after the first-step reaction. 
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are recognized by CwflO, whereas Loop III is 
exposed to solvent in a surface cavity of the 
spliceosome. Cwfl7 is placed in the close proximity 
of Stems II and III. The 3' end U-rich sequences 
of U5 snRNA are specifically bound by the hep- 
tameric Sm ring (Fig. 4A). 

As the largest and most conserved spliceo- 
somal protein, Spp42 is a central component of 
U5 snRNP. Spp42 anchors U5 snRNA while simul- 
taneously interacting with both U2 and U6 
snRNAs. The N-terminal domain of Spp42 is 
entangled with U5 snRNA through an exten- 
sive interface, with an extended loop (residues 
480 to 497) tracking the major groove between 
Stem I and Loop II and thus locking U5 snRNA 
into a relatively rigid conformation (Fig. 4B). 
Several positively charged amino acids from this 
loop, exemplified by Lys^^^ and Arg^^^, make 
hydrogen bonds to the phosphate groups of U5 
snRNA (Fig. 4C). The imidazole side chain of 
His^®° appears to directly recognize G66 through 
a hydrogen bond. On the back side of the major 
groove bound by the extended loop, the RNA 
duplex is contacted by at least seven Arg and Lys 
residues from two a-helices (Fig. 4D). These basic 
residues mediate a number of hydrogen bonds 
to the phosphate backbone of U5 snRNA. The 
guanidinium group of Arg^^^ reaches into the 



minor groove, donating three hydrogen bonds 
to U36 and C64 of U5 snRNA (Fig. 4E). The nu- 
cleotide sequences between Stem I and Loop I 
of U5 snRNA are located in the vicinity of an 
inter-helical loop (residues 786 to 791) from the 
N-terminal domain and an a-helix from the 
Thumb/X domain (Fig. 4F). 

In addition to Spp42, at least five other pro- 
teins make specific interactions with U5 snRNA 
(fig. S4A). The side chains of Thr^^^ and Arg^^^ 
from CwfLO donate two hydrogen bonds to the 
backbone phosphates of C28 and A30, respec- 
tively (fig. S4B), whereas four Lys residues of 
Cwfl7 directly contact the phosphates of A12, 
A13, and G14 (fig. S4C). The bases G96 and 
UlOl are each sandwiched by two aromatic and/ 
or basic residues from the Sm proteins (fig. 
S4D). Last, Cwfl/Prp5 and Cwfl5 associate with 
each other, whereas Lys^^^ of CwfL/Prp5 and 
His^^ of Cwfl5 make specific hydrogen bonds 
to 2'-OH groups of G39 and A59, respectively 
(fig. S4E). 

Placement of snRNA by protein 
components 

Our structure clearly identifies the spliceosome 
as a ribozyme, with the active site in U6 snRNA 
well defined (Fig. 3). Yet, snRNAs only account 



for a very small fraction of the molecular mass in 
the spliceosome, with the rest contributed by a 
large number of spliceosomal proteins. These pro- 
teins can be divided into two general categories: 
enzymes exemplified by the RNA-dependent 
helicases/adenosine triphosphatases (ATPases) 
and structural components that lack enzymatic 
activities (7, 21). Functions of the enzymes in 
the splicing reaction have been relatively well 
understood because alteration of an enzymatic 
activity frequently leads to a splicing defect at a 
specific stage of the reaction. For example, 
mutations in the helicase Brr2 have a negative 
impact on the unwinding of the U4/U6 duplex, 
resulting in stalled activation of the spliceo- 
some preceding the first-step reaction (22-26). 
In contrast, the structural components of the 
spliceosome are less well understood because 
their mutations may either disrupt the spliceo- 
somal assembly or remain functionally silent. 
The organization of the protein components 
relative to that of snRNAs strongly suggests that 
an important function of the structural compo- 
nents is to direct the key snRNA elements to the 
catalytic center while anchoring the ends of 
these snRNA molecules away from the catalytic 
center (Fig. 5A). This arrangement may allow 
sufficient space for the enzymatic components. 
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Fig. 4. Structure of U5 snRNP. (A) Overall structure of U5 snRNR Except 
Brr2, all other core components of U5 snRNP are clearly identified in the 
structure. Brr2 is present in the spliceosome but is highly flexible and shows 
no clear EM density. (B) Recognition of U5 snRNA is mediated predominant- 
ly by the N-terminal domain of Spp42. An extended loop is inserted into the 
major groove between Stem I and Loop II, holding U5 snRNA into registry. 
(C) A close-up view on the specific interactions between residues from the 
extended loop and the major groove of U5 snRNA. His^^° and Asn^^^ appear 



to recognize specific bases in the groove. Potential hydrogen bonds are 
represented by blue dotted lines. (D) Representative interactions between 
helical elements of Spp42 N-terminal domain and Stem I of U5 snRNA. (E) A 
close-up view on Arg^®^ of Spp42. The guanidinium group of Arg^®^ reaches 
deep into the minor groove, making three hydrogen bonds to U36 and C64. 
(F) A close-up view on Loop I of U5 snRNA. An a-helix from the Thumb/X 
domain and a helix-loop-helix motif are positioned close to Loop I for potential 
interactions. 



such as the RNP remodeling proteins, to exert 
their functions. 

The key elements that are essential for the 
splicing reaction are located in the middle por- 
tion of U6 and U2 snRNAs (Fig. 5A). Both the 5 ' 
and 3' ends of U6 or U2 snRNA are anchored 
by the structural protein components at sites 
that are distant from the catalytic center (Fig. 
5B). This arrangement allows for considerable 
flexibility and regulatory space between the cat- 
alytic center and the ends of snRNAs. For exam- 
ple, the 5 ' end of U6 snRNA is anchored by an 
N-terminal a-helix of Spp42 and the WD40 re- 
peat protein Cwfl7 (Fig. 5A and fig. S5A). The 3' 
end portion of U2 snRNA is specifically recog- 
nized by the Leal/Msll complex that is located 
-130 A away from the active site (Fig. 5C). This 
part of U2 snRNA is connected to the catalytic 
center through protein components including 
the half-a-tetratricopeptide (HAT) repeats con- 
taining proteins Cwf3/Syfl and Cwf4/Syf3. The 
5' end of U2 snRNA base-pairs with the 3' end of 
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U6 snRNA to form Helix II, with its tip posi- 
tioned -60 to 70 A away from the active site. 
Helix II is held in place by Spp42, Cwf4/Syf3, 
and other structural protein components (Fig. 
5B and fig. S5B). The intervening sequences be- 
tween the ends and the catalytic center, such as 
those between Helix II and the catalytic center, 
are also oriented by structural protein compo- 
nents (fig. S6). 

Putative location of the 5'-exon 

We sought to locate the spliced 5'-exon in the 
structure, which should be present at a location 
that is close to Loop I of U5 snRNA. However, the 
cryogenic EM (cryo-EM) sample contains a mix- 
ture of different spliceosomal complexes { 19 ), 
which is predicted to considerably weaken the 
EM density for the 5'-exon. Close examination 
of the EM density of the spliceosome reveals a 
surface channel that goes into the active site 
(Fig. 6A). Weak EM density was indeed present 
close to the nucleotides U50-U51-U52 in Loop I 



of U5 snRNA, and its appearance is consistent 
with a few stacked nucleotides (Fig. 6B). Con- 
sidering the degenerate sequences of the 5'-exon, 
the weak EM density is significant and allows 
putative modeling of four contiguous nucleo- 
tides representing the 5'-exon. Such a 5'-exon 
can be comfortably placed through the surface 
channel (Fig. 6C) into the vicinity of the active 
site, with the 3' end of the 5 '-exon within 5 A of 
the catalytic Mg^^ ions (Fig. 6D). This putative 
channel for the 5'-exon is located in the middle 
of the Central Body, surrounded by N-terminal se- 
quences from Spp42 and a helix-loop-helix mo- 
tif from Cwfl9 (Fig. 6C). Consistent with this 
analysis, this portion of the Spp42 surface is 
highly enriched by positively charged amino 
acids (Fig. 6E). 

Comparison with the group I IB 
self-splicing intron 

The catalytic center of the spliceosome closely 
resembles that of the group IIB self-splicing 
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Fig. 5. Placement of RNA molecules by the protein components. (A) 

An overall view on the placement of three snRNAs in the spliceosome. Most 
of the protein components are omitted so as to highlight the characteristic 
organization of snRNAs. The ends of snRNAs are anchored away from the 
catalytic center, and their reactive pieces are placed into the catalytic center 
through protein-directed routes. (B) A surface view of the spliceosome so as to 



highlight the positioning of U2 snRNA. The 3' end of U2 snRNA is anchored by 
Leal/Msil in Arm I, whereas its 5' end pairs up with the 3' end of U6 snRNA at 
the tip of the Central Body. The intervening portion of the U2 snRNA sequences 
are continuously bound and directed by protein components, some of which 
remain to be identified owing to their flexibility. (C) An illustration of the 
placement of U2 snRNA. 



intron (Fig. 7, A to C) (27-29). The ISL of U6 
snRNA in the spliceosome corresponds to Do- 
main V of the group IIB intron P.U.LSUI2 from 
the brown algae Pycnocoma littoralis (29), and 
the Mg^^-coordinating U68 in the spliceosome 
is replaced by the nucleotide A574 in PM.LSUI2. 
The base-pairing interactions between the 
BPS of the intron lariat and U2 snRNA in the 
spliceosome are similar to those in Domain VI 
of the self-splicing intron, with the bulged branch 
adenine nucleotide in BPS corresponding to 
A615 in P.li.LSUI2. The catalytic triad A47-G48- 
C49 in the spliceosome is replaced by A551- 
G552-C553 in the group IIB intron. Formation 
of the triplex in the spliceosome also closely 
resembles that in the self-splicing intron (Fig. 
7, B and C). The catalytically active Mg^^ ions 
are also similarly coordinated, by U68 in U6 
snRNA and A574 in P.U.LSUI2. Both the posi- 
tioning of the catalytic metal ions and the 



catalytic triad are conserved, indicating a com- 
mon evolutionary origin between these two ri- 
bozymes. In addition to the observed striking 
similarities in catalytic mechanism between 
the spliceosome and group IIB introns, the key 
protein component Spp42 (Prp8 in S. cerevisiae) 
contains an RT-like domain, whereas many 
group II introns depend on an RT for folding 
and activity (30). 

Discussion 

Splicing of pre-mRNA is an exceptionally dy- 
namic and complex process. All five snRNPs 
are assembled in the spliceosomal B complex. 
Through the actions of the ATPases Prp28 (U5- 
lOOK in human) and Brr2, U1 and U4 snRNP are 
dissociated, forming an activated spliceosome 
[the activated B complex (B^^)] (21, 23-26, 31). 
Then, the helicase Cdc28 (Prp2 in S. cerevisiae, 
hPrp2/HDX16 in human) dissociates the SF3a/b 



complex from U2 snRNP, allowing BPS to partic- 
ipate in the Step I reaction in the catalytically 
competent spliceosome (B* complex) (32, 33). 
The Step I reaction is greatly facilitated by the 
presence of Cwfl6 (Yju2 in S. cerevisiae, CCDC96 
in human) and Cwf25 (cwc25 in S. cerevisiae, 
CWC25 in human), which juxtapose 5'SS and 
BPS. The catalytic step I spliceosome (C com- 
plex) is formed after the Step I reaction (34). 
Conformational rearrangement, induced by the 
ATPase Prpl6, primes the 5'-exon and 3'SS 
for the initiation of the second step splicing 
(35-37). With the help of the splicing factors 
Slu7, Cdc40 (Prpl7 in S. cerevisiae), Prpl8, and 
the ATPase Prp22, the Step II reaction is com- 
pleted, forming the post-catalytic spliceosome 
(P complex) (38-45). We speculate that the cen- 
tral components of the spliceosomal catalytic 
center— ISL of U6 snRNA, Helix I of the U2/U6 
duplex, and Loop I of U5 snRNA— maintain a 
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Fig. 6. The 5'-exon is likely located in a surface channel close to the 
active site. (A) An overall view on the potential position of 5'-exon in the 
spliceosome. Two views of the EM density of the entire spliceosome are shown, 
with the left displayed as a semitransparent surface. The 5'-exon has been 
modeled into the structure. (B) Weak EM density is found close to Loop I of U5 
snRNA. This weak density, thought to belong to 5'-exon, allows putative model- 



ing of four contiguous RNA nucleotides. (C) The modeled 5'-exon can reach the 
active site through a surface channel. This channel is formed by Spp42 and 
Cwfl9. (D) The modeled 5'-exon is located close to the catalytic Mg^^ ions. 
(E) The surface channel is highly enriched by positively charged amino 
acids. This feature is consistent with the suggestion that the channel is likely 
used to hold the 5'-exon. 



generally similar conformation in the B*, 
C, and P complexes (fig. S7). In response to the 
different stages of the splicing reaction, the 
relative positions of 5'SS, BPS, 3'SS, and their 
associating U2 and U6 snRNA sequences may 
undergo concerted changes. In the B^^^ com- 
plex, the BPS and 5'SS are physically occluded 
from approaching each other (fig. S7), presum- 
ably by the presence of the SF3a/b complex. In 
the B* complex, the BPS and 5'SS are placed 
close to each other but can only proceed to the 
Step I reaction with the help of Cwfl6 and 
Cwf25. In the C complex, the 5'-exon is poised 
for catalysis, but the 3'-exon needs to be brought 
into close proximity through the actions of Step 
II splicing factors. Last, in the P complex the 5'- 
and 3 '-exons are ligated together and may only 
weakly associate with Loop I of U5 snRNA (fig. 
S7). Our model predicts a generally similar 
overall architecture among the four function- 
ally distinct spliceosomal complexes B’^^^ B*, C, 
and P. We further speculate that the splicing 
stage-specific factors and enzymes are recruited 
onto the generally similar spliceosomal com- 
plex to facilitate distinct stages of the splicing 
reaction. 



Our structural analysis corroborates a large 
body of experimental observations. A U2/U6 
three-way junction is formed at the core of the 
RNA network in the purified yeast spliceosome 
(Figs. 2 and 3), which was first proposed by 
Guthrie and colleagues in 1992 (46). Since then, 
extensive research effort has been directed at 
the structure and mechanism of this three-way 
junction. Involvement of the invariant triad 
AGC in the catalytic triplex has been confirmed 
with rescue experiments in which growth de- 
fects caused by triad mutation were suppressed 
by compensatory mutations in the predicted base- 
triple partners (6, 19, 46, 47). The U6 ISL, speci- 
fied by the bulged uridine nucleotide (U68 in S. 
pombe), adopts an unusual secondary structure 
and is indispensable for both steps of the splicing 
reaction (48, 49). Considering the invariant nu- 
cleotides in the catalytic center and the conserved 
catalytic mechanism, the human spliceosome 
should also contain the U2/U6 three-way junc- 
tion, rather than the proposed four-way junction 
(12). The observed structural features also sup- 
port the two-metal mechanism of pre-mRNA 
splicing and confirm the importance of the 
bulged uridine nucleotide in Mg^^ coordination 



(2, 50). The observed U2/U6 three-way junction 
in our cryo-EM structure of S. pombe spliceo- 
some differs in a major way from the solution 
structure of an isolated U2/U6 snRNA complex 
from S. cerevisiae (51). In addition, the confor- 
mation of U6 snRNA in our structure is substan- 
tially different from that in the crystal structure 
of the core U6 snRNP (fig. S8) (47). Last, our 
structural analysis also confirms the detailed 
predictions of the RNA network in the spliceo- 
some (52). 

Structural determination of the yeast spliceo- 
some at 3.6 A resolution represents a major step 
forward for mechanistic understanding of the 
spliceosomal function. Spliceosome is a protein- 
controlled ribozyme. The RNA-based catalytic 
center is assembled on the scaffold of Spp42 and 
U5 snRNP. The ends of the snRNA are anchored 
away from the catalytic center, and the inter- 
vening sequences between the snRNA ends and 
the catalytic center are directed by both struc- 
tural protein components and RNP remodeling 
enzymes. Such a design likely optimizes the 
possibilities of regulation on the catalytic center 
and the splicing reaction. The collective action 
of the numerous protein components serves to 
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Fig. 7. Comparison of the catalytic centers between the spliceosome and the self-splicing group 
MB intron. (A) Comparison of the secondary structural elements in the catalytic centers of the spliceo- 
some and the self-splicing group IIB intron P.H.LSUI2 from the brown algae P. littoralis. (B) Structure of the 
catalytic center of the spliceosome, with a close-up view on the catalytic triplex. Three bases from A41, 
G40, and U68 of U6 snRNA form non-Watson-Crick pairs with bases A47 G48, and C49, respectively. 
(C) Structure of the catalytic center of the self-splicing group IIB intron, with a close-up view on the 
catalytic triplex. A574 of the self-splicing intron corresponds to U68 in U6 snRNA, both playing a major 
role in Mg^^ coordination. Comparison between the spliceosome and the self-splicing group IIB intron 
was performed in PyMol (48). 



deliver the critical pieces— 5'SS, BPS, 3'SS, and 
5'-exon— into the close proximity of the catalytic 
ions for catalysis. 
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MESOSCOPIC PHYSICS 

Coherent manipulation of Andreev 
states in superconducting 
atomic contacts 

C. Janvier/ L. Tosi/’^ L. Bretheau/* Q. 6. Girit/f M. Stern/ P. Bertet/ P. Joyez/ 

D. Vion/ D. Esteve/ M. F. Goffinan/ H. Pothier/ C. Urbrna^ij: 

Coherent control of quantum states has been demonstrated in a variety of 
superconducting devices. In all of these devices, the variables that are manipulated are 
collective electromagnetic degrees of freedom: charge, superconducting phase, or flux. 
Here we demonstrate the coherent manipulation of a quantum system based on Andreev 
bound states, which are microscopic quasi-particle states inherent to superconducting 
weak links. Using a circuit quantum electrodynamics setup, we performed single-shot 
readout of this Andreev qubit. We determined its excited-state lifetime and coherence time 
to be in the microsecond range. Quantum jumps and parity switchings were observed in 
continuous measurements. In addition to having possible quantum information 
applications, such Andreev qubits are a test-bed for the physics of single elementary 
excitations in superconductors. 



T he ground state of a uniform superconduc- 
tor is a many-body coherent state. Micro- 
scopic excitations of this superconducting 
condensate— which can be created, for ex- 
ample, by the absorption of photons with 
high-enough energy— are delocalized and inco- 
herent because they have energies in a continu- 
um of states. Localized states arise in situations 
where the superconducting gap A or the super- 
conducting phase undergo strong spatial varia- 
tions: examples include Shiba states around 
magnetic impurities (1) and Andreev states in 
vortices (2) or in weak links between two super- 
conductors (3). Because they have discrete ener- 
gies within the gap, Andreev states are expected 
to be amenable to coherent manipulation (4-8). 
In the simplest weak link, a single conduction 
channel that is shorter than the superconducting 
coherence length th ere are only tw o Andreev 
levels ±Ea(V^) = ±Ay/i - x sin^(6/2), which are 
governed by the transmission probability t of elec- 
trons through the channel and the phase difference 
5 between the two superconducting condensates 
(3). Despite the absence of actual barriers, quasi- 
particles G)ogoliubons) occupying these Andreev 
levels are localized over a distance around the 

weak link by the gradient of the superconduct- 
ing phase, and the system can be considered an 



Andreev quantum dot (5, 6). Figure 1 shows the 
energies Eg^o,e(5) of the different states of this 
dot. In the spin-singlet ground state |g), only 
the negative-energy Andreev level is occupied 
andF^g = -E^- If a single quasi-particle is added, 
the dot reaches a spin-degenerate odd-parity 
state |o) with Eq = 0 (9-12). Adding a second 
quasi-particle of opposite spin to the dot in 
state |o) brings it to a spin-singlet even-parity 
excited state \e) with E^ = +E^ (13, 14). The \e) 
state can also be reached directly from |g) by 
absorption of a photon of energy 2E^- In this 
work, we demonstrate experimentally the ma- 
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nipulation of coherent superpositions of states 
|g) and \e), even if parasitic transitions to |o) are 
also observed. 

Atomic-size contacts are suitable systems to 
address the Andreev physics because they ac- 
commodate a small number of short conduction 
channels (i5). We create them using the micro- 
fabricated break-junction technique (16). Figure 2 
presents the sample used in the experiment. An 
aluminum loop with a narrow suspended con- 
striction (Fig. 2C) is fabricated on a polyimide 
flexible substrate mounted on a bending mech- 
anism cooled down to ~30 mK (17). The substrate 
is first bent until the bridge breaks. Subsequent 
fine-tuning of the bending allows creating dif- 
ferent atomic contacts and adjusting the trans- 
mission probability of their channels. The magnetic 
flux 4) threading the loop controls the phase drop 
5 = 27 t4)/4)o quantum) across the contact 

and, thus, also controls the Andreev transition 
frequency AC'c, 6) = 2EJh (h, Planck’s constant). 
To excite and probe the Andreev dot, the loop 
is inductively coupled to a niobium quarter- 
wavelength microwave resonator (17) (Fig. 2B) 
in a circuit quantum electrodynamics archi- 
tecture (18, 19). The resonator is probed by 
reflectometry at a frequency /o close to its bare 
resonance frequency/R ^ 10.134 GHz. The actual 
resonator frequency is different for each one of 
the three Andreev dot states: In the odd state, the 
resonance frequency is unaltered, whereas the 
two even states lead to opposite shifts around^ 
(20). The Andreev transition \g)^\e) is driven by 
a second tone of frequency/i. Details of the setup 
are shown in figs. SI and S2 (20). 

Here we present data obtained on a represent- 
ative atomic contact containing only one high- 
transmission channel. Data from other contacts 
are shown in figs. S6 to S8. First, we performed 
pulsed two-tone spectroscopy by applying a 13-ps 
driving pulse of variable frequency, immedi- 
ately followed by a 1-ps-long measurement pulse 
ifo ^ 10.1337 GHz) probing the resonator with an 

c 



®) 

o) 

g) 
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Fig. 1. Single-channel Andreev quantum dot. (A) Energy levels: Two discrete Andreev bound levels 
detach symmetrically from the upper and lower continue of states (light gray regions for |E| > A). 
Photons of energy 2 Ea can induce transitions between the two Andreev levels (magenta arrows). (B) Oc- 
cupation of Andreev levels in the four possible quantum states of the Andreev dot. Only the lower 
Andreev level is occupied in the ground state \g) (blue box). In the excited state |e> (red box), only the 
upper Andreev bound level is occupied. In the doubly degenerate odd state |o), both Andreev levels are 
either occupied or empty. (C) Energy of the four Andreev dot states for a channel of transmission 
probability t = 0.98, as a function of the phase difference 6 across the weak link. 
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amplitude corresponding to an average number of 
photons in the resonator n ^ 30 (Fig. 3A). Apart 
from the signal at/i ^ fo, the spectrum displays a 
resonance corresponding to the Andreev tran- 
sition. The spectrum is periodic in flux, with pe- 
riod (|)o, which allowed us to calibrate the value of 
5 across the contact (fig. S3). Fits of the measured 
lines for different contacts with the analytical 
form of A(t^,S) provide the transmission prob- 
ability T of highly transmitting channels with up 
to five significant digits, as well as the supercon- 
ducting gap A/h = 44.3 GHz of the aluminum 
electrodes. 

The coupling between the resonator and the 
Andreev dot is evident from the avoided crossing 
between the two modes observed in single-tone 
continuous-wave spectroscopy (Fig. 3B). Fitting 
the data with the predictions of a Jaynes- 



Cummings model (19, 20) yields gl2n = 74 MHz 
(g, coupling strength) at the two degeneracy 
points where A = /r. Notably, the resonance of 
the bare resonator is also visible for all values of 
the phase, signaling that on the time scale of the 
measurement the Andreev dot is frequently in 
the odd state |o) (10, 12, 21). 

Figure 3C shows the density plots of the re- 
flected signal quadratures (I and Q) correspond- 
ing to sequences of 8000 measurement pulses 
taken at 5 = tt, without (Fig. 3C, left panel) and 
with (right panel) a tt driving pulse applied just 
before each measurement pulse. The results 
gather in three separate clouds of points, dem- 
onstrating that a single measurement pulse al- 
lows resolution of the dot state. The normalized 
number of points in each cloud is a direct mea- 
surement of the populations of the three states. 



The two panels of Fig. 3C show the population 
transfer between the two even states induced by 
the driving pulse. Continuous measurement of 
the state of the Andreev dot in the absence of 
drive reveals the quantum jumps (22) between 
the two even states and the changes of parity that 
correspond to the trapping and untrapping of 
quasi-particles in the dot (Fig. 3D). The analy- 
sis (23) of this real-time trace yields a parity- 
switching rate of ~50 kHz (20). 

The coherent manipulation at 5 = tt of the two- 
level system formed by \g) and \e) is illustrated in 
Fig. 4. Figure 4A shows the Rabi oscillations 
between |g) and \e) obtained by varying the du- 
ration of a driving pulse at frequency^ = fji'z,n) 
(movie SI). Figure 4B shows how the populations 
of |g) and \e) change when the driving-pulse fre- 
quency /i is swept across the Andreev frequency 




Fig. 2. Measurement setup of a superconduct- 
ing atomic contact in a microwave resonator. 

(A) Simplified two-tone microwave setup. The mea- 
surement (frequency fo) and drive (frequency fi) 
signals are coupled to the resonator through the 
same port. After amplification, the reflected sig- 
nal at fo is homodyne-detected by an IQ mixer, 
and its two quadratures (/ and Q) are digitized. 

(B) Optical micrograph of the quarter-wavelength 
niobium coplanar meander resonator with an inter- 
digitated capacitor C ^ 3 fF at the coupling port. 
At the shorted end, an aluminum loop is induc- 
tively coupled to the resonator field. The resonator 
has resonance frequency ff> ^ 10.134 GHz, with a 
total quality factor Q - 2200, close to critical cou- 
pling (see fig. S4). (C) Detailed view of the aluminum 
loop. Upon bending the substrate, the loop breaks at 
the narrow constriction to create an atomic contact. 
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Fig. 3. Spectroscopy and quantum jumps. (A) Pulsed two-tone spectroscopy: color-coded amplitude 
A of one quadrature of reflected signal as a function of 5 and fi. The dashed black line shows the 
theoretical fit of Andreev transition frequency f^ = 2Ef^/h with t = 0.99217. A parasitic line, corresponding 
to a two-photon process [2fi = f r -t- f^(x, 6)], is visible just below 10 GHz. (B) Single-tone continuous-wave 
spectroscopy using a vector network analyzer (average number of photons in the resonator n ^ 0.1): 
resonator reflection amplitude |f?| as a function of 5 and fo- Red dashed curves correspond to fits of the 
anticrossings (20). Data aligned with the black dashed line correspond to the Andreev dot in state |o). 
(C) Density plots of / and Q quadratures at 6 = ti illustrate single-shot resolution of the quantum state of 
the dot. (Left) No drive at h. (Right) n pulse swapping the populations of |g) and |e>. (D) Gontinuous 
measurement at 5 = ti, with n ^ 100 and no driving signal. The brown (or cyan) time trace corresponds 
to the / (or Q) quadrature. The color (blue, green, or red) of the horizontal bar represents an estimate of 
the state (g, o, or e, respectively) found using a hidden Markov model toolbox (23). 
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f^'z,n). After a n pulse, the populations relax ex- 
ponentially back to equilibrium with a relaxation 
time ri(5 = 7 t) ^4 ps (Fig. 4D). The Gaussian 
decay of detuned Ramsey fringes (Fig. 4F) pro- 
vides a measurement of the coherence time 
T* (5 = 7t) ^ 38 ns. This short coherence time is 
mainly due to low-frequency (i.e., lower-than- 
megahertz) fluctuations of the Andreev energy 
Ea(t,8), as shown by the much longer decay time 
T 2 (5 = 7t) ^ 565 ns » of a Hahn echo (Fig. 
4G). Measurements at 5 = tt on other contacts 
with the same sample, with transmissions corre- 
sponding to a minimal Andreev frequency 3 GHz 
< < 8 GHz, give mostly around 4 ps (up 

to 8.5 ps), 7^2 around 40 ns (up to 180 ns), and T 2 
around 1 ps (up to 1.8 ps), but no clear de- 
pendence of the characteristic times on x is ob- 
served (figs. S7 and S8). 

Figure 4E shows the measured relaxation rate 
Fi = l/7\ as a function of the phase 5. The ex- 
pected Purcell relaxation rate arising from the 
dissipative impedance seen by the atomic con- 
tact Gight blue line in Fig. 4E) matches the 
experimental results only close to the degeneracy 
points where A = /r (vertical dotted lines) but 
is about five times smaller at 5 = tt. On the basis 
of existing models, we estimate that relaxation 
rates due to quasi-particles (24-28) and phonons 
(7, 8, 21) are negligible. Empirically, we fit the 
data at 5 = 7T by considering an additional phase- 
independent relaxation mechanism, which re- 
mains to be identified. 

The linewidth of the spectroscopy line, which 
is a measure of the decoherence rate, shows a 
minimum at 5 = tt (Fig. 4C). The Gaussian decay 
of the Ramsey oscillations points to l//transmis- 
sion fluctuations as the main source of decoher- 
ence at 5 = 7T, where the system is insensitive to 
flux noise to first order (28). Fluctuations of x can 
arise from vibrations in the mechanical setup 
and from motion of atoms close to the contact. 
Figure 4C also shows the linewidths calculated, 
assuming i/f transmission noise and both white 
and 1/ffhix noise (20). The amplitude of the 1/f 
transmission noise, 2.5 x 10“® Hz“^/^ at 1 Hz, 
was adjusted to fit the measurement at 5 = tt. The 
amplitudes of the white and l//fiux noise were 
then obtained from a best fit of the linewidth 
phase dependence. The extracted 1/f noise am- 
plitude (5 p4>o Hz“^/^ at 1 Hz) is a typical value 
for superconducting devices and has a negligible 
effect to second order (29). The source of the ap- 
parent white flux noise (48 n(^o Hz“^/^) has not 
yet been identified. 

The Andreev quantum dot has been proposed 
as a new kind of superconducting qubit (5, 6), 
which differs markedly from existing ones (30). 
In qubits based on charge, flux, or phase (30), the 
states encoding quantum information correspond 
to collective electromagnetic modes, whereas in 
Andreev qubits they correspond to microscopic 
degrees of freedom of the superconducting con- 
densate. Our results are a proof of concept of this 
new type of qubit. Further work is needed to 
fully understand the sources of decoherence and 
to couple several qubits in multichannel contacts 
(5, 8). With its parity sensitivity, the Andreev 
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Fig. 4. Coherent manipulation of Andreev quantum dot states at 8 = tt. Colored dots show 
measured populations: ground (blue), excited (red), and odd (green) states. Lines indicate theo- 
retical fits. Sketches of pulse sequences for each type of measurement are shown in each panel 
(magenta, drive: black, measurement). (A) Rabi oscillations: populations as a function of the driving- 
pulse duration. (B) Spectroscopy: populations as a function of frequency fi of the saturating drive 
pulse. FWHM, full width at half maximum. (C) Phase dependence of the linewidth (FWHM) of the 
spectral line. Circles correspond to the result of a Lorentzian fit of the experimental resonances (20). 
The brown curve is the best fit to the data, including the contributions of 1/f transmission noise (light 
blue line) and both 1/f (orange line) and white flux noise (orange dashed line). (D) Relaxation of 
populations after a n driving pulse. (E) Phase dependence of relaxation rate Fi = 1/Ti. Circles repre- 
sent experimental data. The orange curve is the sum of the expected Purcell rate (light blue line) and 
an empirical phase-independent rate (180 kHz). In (C) and (E), vertical dotted lines indicate degener- 
acy points fA = fp- Error bars on circles denote uncertainties of the fits. (F) Ramsey fringes: popula- 
tions as a function of delay between the two n/2 pulses detuned at fi = fA(x,?i) -r 51 MHz. (G) Hahn echo: 
populations as a function of delay between the two k/2 pulses with a n pulse in between. 
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quantum dot is also a powerful tool to investigate 
quasi-particle-related limitations on the perfor- 
mance of superconducting qubits (28, 31, 32) and 
detectors (33). Furthermore, our experimental 
strategy could be used to explore hybrid super- 
conducting devices in the regime where Andreev 
states evolve into Majorana states (34-36). 
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CRITICAL PHENOMENA 

Critical behavior at a dynamic vortex 
insulator-to-metal transition 



Nicola Poccia/’^ Tatyana I. Baturina, Francesco Coneri,^ Cor G. Molenaar,^ 

X. Renshaw Wang,^ Ginestra Bianconi,® Alexander Brinkman,^ Hans Hilgenkamp,^ 
Alexander A. Golubov, Valerii M. Vinokur^* 



An array of superconducting islands placed on a normal metal film offers a tunable 
realization of nanopatterned superconductivity. This system enables investigation of the 
nature of competing vortex states and phase transitions between them. A square 
array creates the eggcrate potential in which magnetic field-induced vortices are frozen 
into a vortex insulator. We observed a vortex insulator-vortex metal transition driven 
by the applied electric current and determined critical exponents that coincided with those 
for thermodynamic liquid-gas transition. Our findings offer a comprehensive description 
of dynamic critical behavior and establish a deep connection between equilibrium and 
nonequilibrium phase transitions. 



C ritical behaviors near phase transitions can 
be classified into universality classes deter- 
mined only by a few properties characteriz- 
ing the system, such as space dimensionality, 
range of interaction, and symmetry (1, 2). A 
paradigmatic concept of universality brought deep 
understanding of equilibrium critical phenomena 
[see, e.g., (3) and references therein]. Phase tran- 
sitions and criticality far from equilibrium are less 
well understood. The experimental evidence for 
universality of nonequilibrium phase transitions 
is still scarce, calling for intensified experimental 
efforts. 

Superconducting vortices offer a unique tun- 
able laboratory for studying classical critical 
dynamics. To that end, we prepared an array 
of superconducting islands where vortices are 
pinned between the islands in the areas of weaker 
proximity-induced superconductivity— that is, at 
the energy dimples of an eggcrate potential (4). If 
thermal fluctuations are not strong enough to 
overcome the combined localizing action of mutual 
repulsion and pinning, vortices form the so-called 
vortex Mott insulating state at commensurate 
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fields corresponding to an integer number of 
vortices per pinning site (5). The predicted vortex 
Mott state seen in experiments on antidot arrays 
in superconducting films (6, 7) was conclusively 
confirmed in (8). In our experiment, performed 
in a classical regime, varying the magnetic field 
provides precise control over the vortex density and 
tunes the ratio of the vortex repulsion to the mobili- 
ty, enabling the observation of a vortex insulator- 
to-metal transition. 

Each of our samples consists of a 40-nm Au 
layer, patterned as a four-point setup in a van 
der Pauw configuration for transport measure- 
ments, on a Si/Si02 substrate (9). The Au pattern 
is overlaid with a square array of superconduct- 
ing niobium (Nb) islands 45 nm thick. An array 
contains 90,000 Nb islands placed with a period 
a = 267 nm. The diameter of an island is 220 + 
3 nm and the island separation is 47 ± 3 nm. 
Shown in Fig. 1, A to D, are scanning electron mi- 
croscopy (SEM), atomic force microscopy (AFM), 
and optical images of a sample and the height 
profile along one of the principal axes of the ar- 
ray. The superconducting transition temperature 
of the array, determined as the midpoint of the 
temperature resistance curve in the upper inset 
in Fig. lA, is Tc = 7.3 K, which is 2 K lower than 
that of bulk Nb (T^q = 9.3 K). This implies that 
the array is a strongly coupled network of super- 
conducting islands (10-12). The parameters of our 
array ensure that the intersite barriers are high 
enough to provide pinning sufficient for forma- 
tion of the vortex Mott insulator state and that 
vortex motion is thermally activated. 

The measurements are carried out in a shielded 
cryostat at temperature T = 1.4 K Figure 2A shows 
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color plots of the differential resistance, dV/dl, as 
a function of the applied current and the magnetic 
field in units of frustration parameter /= B/Bq, 



where Bq = Oq jo? = 28.6 mT, and Oq = is 
the magnetic flux quantum. Figure 2B displays 
representative isocurrent cuts as dV/dl versus B. 



Fig. 1. Experimental 
setup. (A) SEM image 
of the square array of 
Nb islands on Au. The 
inset at lower left is a 
magnified image of the 
array. The inset at 
upper right shows the 
temperature 
dependence of the 
resistance near the 
superconducting 
transition of bulk Nb 
(marked by the verti- 
cal bar). (B) 1.4 |im x 
1.4 |im AFM topogra- 
phy of the super- 
conductor islands on 
the metallic template 
of the same sample. 
(C) Optical image of 
the Au template with 
four contacts (yellow), 
Nb bus bars (light 
green), and the array 
of 300 X 300 Nb 
islands barely visible 
as a red square 
shadow on a Si/Si 02 
substrate (dark 
green). (D) Height 
profile along the prin- 
cipal axis of the array. 
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At modest currents, the data reveal a wealth of 
dips in dV/dl at integer frustrations, namely at 
/= 1, 2, and 3 (corresponding to integer numbers 
of flux quanta per elemental square of an array), 
as well as a fine structure of fractional dips at 
/= Vs, V 2 , %, Vs, ••• • These minima reflect 
the modulation of the ground-state energy 
due to formation of periodic vortex patterns in a 
magnetic field (Fig. 2, C and D). The energy spec- 
trum of the fractional vortex Mott insulator shown 
in Fig. 2C is the Hofstadter butterfly spectrum of 
the Harper equation (73). The dips in the resist- 
ance and singularities in magnetization at com- 
mensurate frustrations were observed in numerous 
experiments [see (74) and references therein]. 

Our key observation is a reversal of the mini- 
ma in dV/dl into maxima at rational /upon in- 
creasing the current bias. Scaling analysis (see 
below) shows that this is a direct manifestation 
of the existence of the vortex Mott insulator and 
its transition into a metallic state. Reversals at 
commensurate fields/; = Vs, V 2 , Vs, T ••• are most 
pronounced at integer and half-integer frus- 
tration factors (Fig. 2B). The insulating behavior 
appears as a tendency to downward divergence 
of the dV/dl traces on approach to/. Upon the 
vortex Mott insulator-to-metal transition with 
the increasing current, an upward divergence of 
dV/dl traces marks the metallic-like dV/dl be- 
havior near/. Figure 3 shows the plots of dV/dl 
versus B in the vicinity of / = V 2 , 1, 2. Similar 
minima-to-maxima flips were observed in regular 
superconducting systems of different geometries 
(75, 16). However, the mechanism of current- 
stimulated depinning proposed in (75, 16) would 
lead to disappearance of the dips in the resistance 
R - V/I responsible for the dissipation, and 
not only in dV/dl. This is not the case in our 
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Fig. 2. Vortex Mott insulator-to-metal transition. (A) Color plots of dV/dl 
versus current and magnetic field. The color scale indicates dV/dl values in 
ohms. (B) Representative dV/dl versus f = B/Bq curves at different bias 
currents. At low current bias (blue, orange, and yellow), dV/dl minima at 
f = Vs- V2. Vs- 1- Vs- V2. Vs- 2, Vs- V2- Vs- and 3 indicate formation of a 
vortex Mott insulator. Increasing current reverses minima into maxima (red 



and violet). Reversals manifesting the insulator-to-metal transition are 
highlighted by vertical arrows. (C) Ground-state energy Eg versus f obtained 
from the Harper equation (14, 23). (D) Vortex configurations at rational 
frustrations f = 0, Vs- V2- Vs- and 1. Periodic arrays of vortices (yellow 
circles) are superimposed on the SEM subimages of the square array of 
Nb islands. 
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experiment (see fig. SI). The resistance exhibits 
pronounced dips indicating strong pinning at 
rational/even at the currents where dV/dl shows 
profound maxima. Hence, the dip-to-maxima flips 
cannot be explained as vortex depinning. 

Shown in the upper row of Fig. 3 are detailed 
plots of dV/dl versus B in the vicinity of fc = \ 

2. There are separatrices, Iq, highlighted by dots 
in Fig. 3, A to C, dividing insulating-like and 
metallic-like curves. We identify as critical 
currents at which the dynamic vortex Mott tran- 
sition occurs, and find them from the condition 
d{dVldI)/df\f=f± = 0. For I > the dV/dl 
traces display minima, marked by strokes in 
Fig. 3, A to C. That Iq and Iq are different im- 
plies that in the current intervals /g (min{/Q , /q } ^ 
max{7Q,7Q}) at commensurate values f, vortex 
Mott insulating and metallic phases coexist (17). 

Now we turn to the proximity of the vortex 
insulator-to-metal transition critical point, (7o,/c). 
We conjecture mapping of the variables describ- 
ing the critical behavior of the static electronic 
Mott transition to those of the dynamic vortex Mott 
transition (9): \ U-U^\^h= |/c -/|, \T- T^\ 

|7 - 7ol, and the critical point (Tc, f/c) ^ (Io,fc), 



yielding critical scaling |7 - 7ol |&|^. To dem- 
onstrate the critical behavior, we scale dV/dl 
data into the universal form 

where e is an adjustable parameter. The best fit 
of the data near /c = ^2 is achieved at e = ^2 
(Fig. 3D). The same procedure for the left and 
right sides of/c = 1 and 2 gives rise to e = 7s 
(Fig. 3, E and F). Double-logarithmic plots in 
the vicinity of the above values of/c display a 
power-law functional form of F+(o}) ^ as 
shown in the lower panels of Fig. 3, D to F. For 
fc = V 2 , we find p = 1 + 0.03. For both/c = 1 and 
fc = 2, we find p = 1.2 + 0.03 for all four plots. This 
universal scaling behavior of dV(B, T)/dI exper- 
imentally establishes the existence of the vortex 
Mott insulator and the dynamic vortex insulator- 
to-metal transition in superconducting networks. 

To gain insight into the observed scaling behav- 
iors, we focus on/ = 1. At 7 << 7 q, thermally ac- 
tivated dynamics of vortices is governed by motion 
of large vortex bundles and occurs via a creep 



mechanism without breaking the vortex lattice 
integrity, resulting in V ^ exp(-7o/7) (18). General 
creep considerations (19) suggest that at some 
threshold current less than the depinning current, 
the elastic continuity of the vortex system breaks 
down and vortex dynamics occurs via the plastic 
gliding of dislocations. The latter is equivalent to 
phase slips of the superconducting order param- 
eter resulting in the linear ohmic response (i.e., 
the metallic behavior). Identifying this plastic 
threshold current with 7 q leads to the notion that 
the vortex metal is a phase whose dynamics is 
controlled by phase slips. Very near the thresh- 
old, 7 ^ 7o, thermally activated dynamics are 
governed by activation over a small parabolic 
barrier whose height scales as [1- (7/7o)]^^^, 
giving rise to 

Voc 1 (2) 

1 + exp { const [1 - (7/7o)]^^^} 

(20). Thus, the characteristic energy control- 
ling the response of the vortex system near the 
vortex insulator-to-metal transition scales as 
1 7 - 7ol^^^, which explains the origin of the ex- 
perimentally observed scaling |5| oc |7 - 7ol^^^ 



A 



B 



c 




1 — r™ 



tttT 1 irriTfri 

10 ^ 10 ^ 











Fig. 3. Scaling at the dynamic vortex Mott insulator-to-metal transition. (A 

to C) dV/dl versus f in the vicinity of fc = V 2 , I and 2, respectively. Each panel 
contains 36 traces in 0.01-mA steps. Colors reflect magnitudes of the current. The 
dots mark (dV/dl)(l = Iq) values separating insulating-like and metallic-like 
behaviors. The separatrix traces are chosen so that dV/dl values start to turn 
upward on approach to fc at / > Iq, dV/dl maintains a downward trend near fc 
at / < Iq, and d(dV/dl) /df\f = 0 at / = /q. In the metallic regions, dV/dl minima 
marked with strokes enclose the critical regions around fc.The traces are not sym- 
metric with respect to f^ so Iq and /J are different: (A) Iq = 0.83 mA, /J = 0.78 mA; 



(B) Iq = 1.10 mA, /J = 1.14 mA; (C) Iq = 0.45 mA, /J = 0.48 mA. (D to F) Scaling of 
the same data after subtracting the corresponding separatrix, dV/dl - (dV/dl)(l = Iq) 
with respect to variable |/ - with b = f- fc and 6 = 2 for fc = V 2 (D) and 

e = 7a for fc = l and fc = 2 [(E) and (F)]. Subpanels at left and right show scaling 
plots in the fc - |b| and fc -i- |b| vicinities of the critical frustration parameters, 
respectively: the colors refer to different |b| values. Upper panels use the linear scales 
of dV/dl - (dV/dl)(l = Iq)\ lower panels use logarithmic scales. The scales are the 
same for each fc pairwise for the left and right subpanels. The black lines in the lower 
panels correspond to powers = 1 for fc = V 2 - and to[i = 1.2 ± 0.03 for fc = l and 2. 
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for integer frustrations. Notably, the observed 
value of 8 = for the integer dynamic transi- 
tion coincides with the corresponding mean-field 
value for the temperature- and pressure-driven 
thermodynamic electronic Mott transition (21) 
belonging in the class of the liquid-gas transi- 
tion of classical systems (1, 21, 22). The universal 
scaling properties of the current- and magnetic 
field-dependent dynamical resistivity experi- 
mentally demonstrate that a vortex Mott insulator 
undergoes a phase transition resembling a liquid- 
to-gas transition at the nonequilibrium critical 
end point. The critical exponent at^ = 72 is 
8 = 72 , indicating that fractional vortex Mott tran- 
sitions belong in different universality classes. 
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QUANTUM MECHANICS 

A self-inteifering clock as a “which 
path” witness 

Yair Margalit, Zhifan Zhou, Shimon Machluf,^ Daniel Rohrlich, 

Yonathan Japha, Ron Folmanf 

In Einstein’s general theory of relativity, time depends locally on gravity; in standard 
quantum theory, time is global— all clocks “tick” uniformly. We demonstrate a new tool 
for investigating time in the overlap of these two theories: a self-interfering clock, 
comprising two atomic spin states. We prepare the clock in a spatial superposition of 
quantum wave packets, which evolve coherently along two paths into a stable interference 
pattern. If we make the clock wave packets “tick” at different rates, to simulate a 
gravitational time lag, the clock time along each path yields “which path” information, 
degrading the pattern’s visibility. In contrast, in standard interferometry, time cannot yield 
“which path” information. This proof-of-principle experiment may have implications for the 
study of time and general relativity and their impact on fundamental effects such as 
decoherence and the emergence of a classical world. 



T wo-slit interferometry of quanta, such as 
photons and electrons, figured prominently 
in the Bohr-Einstein debates on the consist- 
ency of quantum theory (7, 2). A fimdamental 
principle emerging from those debates— 
intimately related to the uncertainty principle— 
is that “which path” information about the quanta 
passing through slits blocks their interference. At 
the climax of the debates, Einstein claimed that a 
clock, emitting a photon at a precise time while 
being weighed on a spring scale to measure the 
change in its mass-energy, could evade the un- 
certainty principle. Yet Bohr showed that the 
clock’s gravitational redshift induces enough un- 
certainty in the emission time to satisfy the un- 
certainty principle. Inspired by the subtle role that 
time may play, we have now sent a clock through 
a spatial interferometer. Our proof-of-principle 
experiment introduces clock interferometry as 
a new tool for studying the interplay of general 
relativity (3) and quantum mechanics (4). 

Time in standard quantum mechanics is a global 
parameter, which cannot differ between paths. 
Hence, in standard interferometry (5) a differ- 
ence in height between two paths merely affects 
their relative phase, shifting their interference 
pattern without degrading its visibility. In con- 
trast, general relativity predicts that a clock must 
“tick” slower along the lower path; thus if the 
paths of a clock through an interferometer have 
different heights, a time differential between the 
paths will yield “which path” information and 
degrade the visibility of the interference pattern 
(6). Consequently, whereas standard interferom- 
etry may probe general relativity (7-9), clock in- 
terferometry probes the interplay of general 
relativity and quantum mechanics. For example, 
loss of visibility because of a proper time lag would 
be evidence that gravitational effects contribute 
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to decoherence and the emergence of a classical 
world (10). 

Although our interferometer is of a new type, 
it is worthwhile noting decades of progress in 
matter-wave interferometry (IT). Specifically, we 
note experiments in which neutron spins have 
been rotated in a magnetic field (12, 13) and ex- 
periments in which different paths were labeled 
(14). For completeness, we also note recent work 
on the so-called Compton clock interferometer 
(75) and the debates that ensued [(75, 77) and 
references therein]. 

In our experiment, atomic clocks— atoms in 
superpositions of internal states— pass through 
an atomic matter-wave interferometer. We dem- 
onstrate that the visibility of interference pat- 
terns produced by thousands of self-interfering 
clocks [atoms in a Bose-Einstein condensate (BEC)] 
depends on the (simulated) proper time differ- 
ential between the recombined wave packets of 
each clock. We simulated the time differential or 
lag by artificially making one clock wave packet 
“tick” faster than the other. Although our clock is 
not accurate enough to be sensitive to special- or 
general-relativistic effects, it is able to demon- 
strate that a differential time reading affects the 
visibility of a clock self-interference pattern (6); 
specifically, the visibility equals the scalar pro- 
duct of the interfering clock states. 

In principle, any system evolving with a well- 
defined period can be a clock. We used a quantum 
two-level system: a ®^Rb atom in a superposition 
of two Zeeman sublevels, the rrip = 1 and mp = 2 
sublevels of the F =2 hyperfine state. 

The general scheme of the clock interferome- 
ter is shown in Fig. 1 (78). To prepare the clock in 
a superposition of two different locations, we 
made use of the previously demonstrated Stern- 
Gerlach type matter-wave interferometer on an 
atom chip (19), creating a coherent spatial super- 
position of a ^^Rb BEC (~10^ atoms) 90 pm below 
the chip surface. Initially, after the application 
of a field gradient beam splitter (FOBS) and a 
stopping pulse that zeroes the relative velocity of 
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the two atomic wave packets, the wave packets 
are in the same internal atomic state = 

1 2, 2) = 1 2)) as well as in the same external mo- 
mentum state. A radio-frequency (RF) tt/2 pulse 



(Rabi frequency Ur and duration Tr) tuned to 
the transition from |2) to |1) = |2,1) forms the 
clock by transferring the atoms from the 1 2) state 
to the internal superposition state (|1) + |2))/\/2. 



A Coherent Clock Relative clock 
splitter initialization rotation 



Imaging 



B 



Atom chip 




Fig. 1. Experimental sequence of the clock interferometer. (A) Detailed sequence (not to scale). After 
a coherent spatial splitting by the FOBS and a stopping pulse, the system consists of two wave packets in 
the |2) state (separated along the z axis, the direction of gravity) with zero relative velocity (J9).The clock is 
then initialized with a RF pulse of length 7 r, after which the relative “tick” rate of the two clock wave packets 
may be changed by applying a magnetic field gradient dB/dz of duration Tq. Last, before an image is taken (in 
thexz plane) the wave packets are allowed to expand and overlap. (B) The same sequence as in (A) presented 
in space (in the yz plane). The chip wires are parallel tox, and the imaging beam is parallel toy. (C) Evolution in 
time, synchronized with (A). Each clock wave packet shows as a one-handed clock, in which the hand 
corresponds to a vector in the equatorial plane of the Bloch sphere. When the clock reading (the position of the 
clock hand) in the two clock wave packets is the same, fringe visibility is high. (D) When the clock reading is 
opposite (orthogonal), it becomes a “which path” witness, and there is no interference pattern. 



The pulse is applied under a strong homoge- 
neous magnetic field in order to push the tran- 
sition to 1 2, 0) out of resonance via the nonlinear 
Zeeman effect, thus forming a pure two-level 
system (18). 

In order to examine the coherence of the 
clock spatial superposition, we let the two clock 
wave packets freely expand and overlap to create 
spatial interference fringes (Fig. 2A). Because two 
BEC wave packets are always expected to yield 
fringes when they overlap, many experimental 
cycles are required in order to prove phase sta- 
bility or, in other words, coherent splitting of the 
clock. In the averaged picture of 100 single shots 
taken continuously over a period of ~2 hours 
(Fig. 2B), the contrast falls by a mere 2% relative 
to the mean of the single-shot visibility, demon- 
strating a stable phase. The narrow phase distri- 
bution in the data (18) reveals that the clock 
splitting is coherent (meaning the clock was in 
a superposition of two locations). 

We now show that clock time is indeed a 
“which path” witness. For a single-intemal-state 
interferometer, a phase difference will not change 
the visibility of the fringes. In contrast, the relative 
rotation between the two clock wave packets is 
expected to influence the interferometric visi- 
bility. In the extreme case, when the two clock 
states are orthogonal— for example, one in the 
state (|1) + |2))/\/2 and the other in the state 
(|1) - |2))/v^— the visibility of the clock self- 
interference should drop to zero. We applied a 
magnetic gradient pulse [inducing a “tick” rate 
difference Aco (iS)] of duration Tq so as to in- 
duce a relative angle of rotation between the two 
clock wave packets (Fig. 1). When the relative 
rotation angle is n, we observed in a single shot 
that the visibility of the interference pattern drops 
to zero (Fig. 2C). Again, this is unlike standard 
interferometers in which a phase difference does 
not suppress visibility, which this is contrary to 
standard split-BEC interference experiments in 
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Fig. 2. Clock interfer- 
ence. (A) A single 
experimental shot of a 
clock interfering with 
itself (z axis values are 
relative to the chip sur- 
face). As Tq = 0, the 
clock rate is approxi- 
mately the same in the 
two wave packets, and 
interference is visible. As ^ 
can be seen from Fig. 3, ^ 

a constant differential 
rotation of the clocks, 
exists even for Tq = 0 
[because of a residual 
magnetic gradient in our 
chamber (18)]. This 
somewhat reduces the 

visibility. (B) To prove the coherence of the clock spatial splitting, an average of 100 consecutive shots such as that in (A) is presented, with only a 2% drop in 
visibility (18). (C) To prove that clock time acts as a “which path” witness, we present a single shot in which the differential rotation angle 4>o + AcoTq equals 
71. (D) Similar to (B), but where indistinguishability is restored by (^o + AcoTq = 2ti (visibility is 47 ± 1%, down from a single-shot average of 51 ± 2%). The fits are 
a simple combination of a sine with a Gaussian envelope (19). The vertical position as well as the vertical extent of the clouds is explained in (18). Throughout 
this work, all data samples are from consecutive measurements without any post-selection or post-correction. 
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which a single shot always exhibits substantial 
visibility. A revival of the single-shot visibility is 
seen when the differential rotation angle is taken 
to be 2n (where we again present an average of 
100 shots to confirm coherence) (Fig. 2D). 

To obtain a more general view of the effect, we 
studied the dependence of the interference pat- 
tern visibility on the differential rotation angle 
between the two clock wave packets over the 
range 0 to 47 t, by varying Tq to alternate be- 
tween clock indistinguishability and orthogonality 
(Fig. 3). The blue data in Fig. 3 present the clock 
interference pattern visibility, clearly showing os- 



cillations [consistent with the expected period 
(iS)]. Comparing the latter oscillations with the 
visibility of a single-intemal-state “no clock” inter- 
ference (UrTr = 0) (Fig. 3, red data) confirms that 
the oscillations are due to the existence of a clock. 
The single-internal-state interference data also 
confirm that the overall drop in visibility is not 
due to the formation of the clock. This upper 
bound is due to the magnetic gradient pulse 
causing imperfect overlap between the two wave 
packets (18). A lower bound on the visibility is 
due to the spatial separation of the 1 1) and 1 2) wave 
packets (gradual breakup of the clock), again due 



to the magnetic gradient (18), which results in an 
increase of the visibility as expected from two 
independent single-state interferometers. 

The essence of the clock is that it consists of 
a superposition of two levels, |1) and |2). In Fig. 3, 
we chose to work with an equal population of 
the |1) and |2) states upon clock initialization 
to create a proper clock, thus maximizing the vi- 
sibility’s dependence on the differential rotation. 
To further prove that it is the clock reading that 
is responsible for the observed oscillations of 
visibility, we modulated the very formation of 
the clock by varying the clock-initiating RF 
pulse (Tr) so that the system preparation alter- 
nates between a proper clock and no clock at all 
(Fig. 4). Specifically, varying Tr changes the pop- 
ulation ratio of the two components of the clock. 
When the differential rotation of the two clock 
wave packets (AcoTg) is set to n (orthogonal 
clocks), as shovm by the blue data in Fig. 4A, the 
interferometer visibility oscillates as a function 
of the ratio of the clock states’ initial population. 
This is so because when UrTr equals multiples of 
7T, only one of the clock states is populated, and 
the system is actually not a clock. In this case, we 
have a standard interferometer; “clock orthog- 
onality” and clock time as a “which path” wit- 
ness do not exist irrespective of the fact that 
AcoTg = n, and consequently, full visibility is ob- 
tained. When a proper clock is formed (equal ini- 
tial populations), clock time is an effective witness, 
and the visibility drops. In contrast, when AcoTg = 
27t (Fig. 4A, red data), the interferometer visibility 
is always high because the two wave packets are 
not orthogonal whether they are clocks or states 
with a definite mp. 

Our realization of clock interferometry dem- 
onstrates a way to probe the interplay between 
quantum mechanics and general relativity. In this 
context, some even suggest that wave-function 
collapse maybe due to gravity (20-22). It remains 
to be seen whether clock interferometers can 
provide new insight regarding such mechanisms 
as well. In addition, because time is considered 
by some a parameter that is still far from being 
fully understood (23), such an interferometer may 
shed new light on a variety of related fundamen- 
tal questions. 
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ELECTROCHEMISTRY 

Covalent organic frameworks 
comprising cobalt porphyrins for 
catalytic CO2 reduction in water 

Song Lin/’^* Christian S. Diercks/’^* Yue-Biao Zhang/’^’^* Nikolay Kornienko/ 

Eva M. Nichols/’^ Yingbo Zhao/ Aubrey R. Paris/ Dohyung Kim/ Peidong Yang/’^’^’® 
Omar M. Yaghi/’^’®’^| Christopher J. Chang^’^’®’®f 

Conversion of carbon dioxide (CO 2 ) to carbon monoxide (CO) and other value-added 
carbon products is an important challenge for clean energy research. Here we report 
modular optimization of covalent organic frameworks (COFs), in which the building units 
are cobalt porphyrin catalysts linked by organic struts through imine bonds, to prepare a 
catalytic material for aqueous electrochemical reduction of CO 2 to CO. The catalysts 
exhibit high Faradaic efficiency (90%) and turnover numbers (up to 290,000, with initial 
turnover frequency of 9400 hour“^) at pH 7 with an overpotential of -0.55 volts, equivalent 
to a 26-fold improvement in activity compared with the molecular cobalt complex, with no 
degradation over 24 hours. X-ray absorption data reveal the influence of the COF 
environment on the electronic structure of the catalytic cobalt centers. 



G lobal energy demands and climate change 
underpin broad interest in the sustainable 
reductive transformation of carbon diox- 
ide (CO 2 ) to value-added carbon products 
such as carbon monoxide (CO) (1, 2). Elec- 
trolytic approaches benefit from using water as 
the reaction medium, as it is a cheap, abundant, 
and environmentally benign solvent that facili- 
tates proton and electron transfer (3, 4). How- 
ever, the competitive and often kinetically favored 
off-pathway reduction of water itself to hydrogen 
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must be avoided. In this context, molecular cata- 
lysts for electrochemical CO 2 conversions can be 
systematically tuned to achieve high activity and 
selectivity over proton reduction (5-13), but they 
typically require organic media to achieve optimal 
selectivity and/or stability, often to maximize sol- 
ubility and minimize water- or proton-induced 
catalyst degradation and/or hydrogen produc- 
tion. In contrast, heterogeneous catalysts are 
often stable in water, but optimizing their activity 
through structural changes at a molecular level 
remains a substantial challenge (14-19). Against 
this backdrop, we sought to investigate crystal- 
line porous frameworks— specifically, covalent 
organic frameworks (COFs) (20-22)— diS tunable 
materials for electrocatalysis. We reasoned that 
such materials could potentially combine advan- 
tages of both molecular and heterogeneous cat- 
alysts: (i) Construction with molecular building 
blocks would enable precise manipulation of the 
spatial arrangement of catalytic centers within 
the predetermined COF structure (23)', (ii) the 
frameworks could be expanded and functional- 
ized without changing the underlying topology 
of the structure (24, 25)', and (iii) the conserved 
pore environment around the active sites within 
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the COF could be tuned electronically and steri- 
cally (26) while providing ready access for the 
substrate (27-32) (Fig. 1). Moreover, these crys- 
talline porous frameworks offer the possibility to 
perform multivariate synthesis, in which topo- 
logically identical and yet functionally modified 
building blocks can be introduced into the struc- 
ture. This approach can potentially give rise to 
materials with emergent properties that are 
greater than the sum of the individual molecular 
parts, because one can predictably prepare a 
topologically ordered framework yet introduce 
heterogeneity in the number and ratio of func- 
tionalities by the choice of building blocks (33). 
Here we show that incorporation of catalytic 
cobalt porphyrin (34) units into COFs, along 
with multivariate synthesis of frameworks bearing 
catalytic cobalt and structural copper units, gives 
highly active, stable, and selective catalysts for 
electrochemical reduction of carbon dioxide to 
carbon monoxide in water. A member of the COF 
series that we studied exhibits a 26-fold increase 
in activity compared with the parent molecular 
precursor and, in many respects, outperforms 
state-of-the-art molecular and solid-state catalysts, 
with broad opportunities for further improve- 
ment through modular synthesis using appro- 
priate combinations of building units. X-ray 
absorption measurements reveal that the COF 
framework can directly influence the electronic 
structure of the catalytic cobalt centers, in a man- 
ner akin to redox noninnocent ligand behavior 
observed in molecular systems (35), thereby con- 
tributing to the observed gains in reaction se- 
lectivity and activity beyond the steric effects of 
surface area and site isolation. 

We focused our initial electrocatalysis studies 
on COFs, as we sought to exploit the charge- 
carrier mobility of these materials derived from n 
conjugation and n-n stacking (22, 36-38), as well 
as the stability from reticular assembly with strong 
covalent bonds. We synthesized a model framework 
(COF-366-Co) by the imine condensation of 5,10, 
15,20-tetrakis(4-aminophenyl)porphinato]cobalt 
[Co(TAP)] with 1,4-benzenedicarboxaldehyde (BDA) 
(Fig. 1). The porous COF material was evacuated 
by activation with supercritical carbon dioxide 
and heating to 100°C for 18 hours. The re- 
tention of cobalt in the coordinating porphyrin 
units within the framework was confirmed by 
elemental analysis (supplementary materials sec- 
tion S1.1), thermogravimetric analysis (fig. SI), and 
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solid-state ultraviolet-visible (UV-Vis) spectros- 
copy (fig. S3). The formation of the imine link- 
ages in the COF was confirmed by attenuated 
total refiectance-Fourier transform infrared spec- 
troscopy, which showed the characteristic imine 
stretching vibration band at 1621 cm"^, as well 
as the absence of the aldehyde stretching vibra- 
tion band at 1684 cm"^ (fig. SIO). The morphol- 
ogies of the activated COF samples were examined 
by scanning electron microscopy (SEM), which 
showed aggregation of only one kind of crystal- 
lite of rectangular rod-shaped morphology (~50 nm 
in length) (Fig. 2A). Powder x-ray diffraction 
(PXRD) patterns (Fig. 2B) showed intense peaks 
in the expected low-angle range, with no residual 
peaks characteristic of the starting materials. To 
elucidate the constitution of the framework, we 
used Materials Studio 7.0 to build a structural 
model (Fig. 1) in an orthorhombic Cmmm space 
group to allow the lattice distortion from regular 
square nets. Pawley refinements of the PXRD 
patterns were carried out for full profile fitting 
against the proposed models, resulting in a good 
agreement factor (weighted-profile R factor R^ = 
2.59% and unweighted-profile R factor R^ = 1.38% 
after convergence) and reasonable profile differences. 
These refinements revealed one-dimensional (ID) 
channels, 21 A in width, running along the c axis, 
with a distance of 4.4 A between the stacking 2D 
sheets. The porosity and specific surface area were 



determined using nitrogen adsorption isotherms at 
77 K (Fig. 2C). The Brunauer-Emmett-Teller (BET) 
surface area was determined to be 1360 m^/g. 
Density functional theory (DFT) fitting of the 
adsorption branches showed relatively narrow 
pore size distributions (10 to 18 A), in agreement 
v^th that of the proposed model. 

For electrochemical experiments, the activated 
microcrystalline COF powders were deposited on 
porous, conductive carbon fabric. Cyclic voltam- 
metry (CV) measurements on COF-366-Co were 
conducted in pH 7 aqueous phosphate buffer 
(Fig. 3A). Under a nitrogen atmosphere, the 
voltammogram trace shows a departure from the 
electrode background at about -0.45 V versus a 
reversible hydrogen electrode (RHE). A broad re- 
duction wave, which was assigned to the Co(II)/ 
Co(I) redox couple, is also apparent at about -0.67 V. 
The surface concentration of electrochemicaHy ac- 
tive cobalt porphyrin sites was measured by integra- 
tion of the reduction wave to be 1 x lO"^ mol/cm^, 
corresponding to activity at 4% of the cobalt por- 
phyrin sites in the material. The observed contin- 
uous current increase at potentials more negative 
than -0.67 V likely arises from Co(I)-catalyzed pro- 
ton reduction activity (39). After the solution was 
saturated v^th carbon dioxide, a substantial cur- 
rent enhancement was observed [catalytic current/ 
noncatalytic current ratio (W^) = 1-8 at -0.67 V] 
(40), with a catalytic onset potential at -0.42 V, 



indicating a prominent catalytic effect of COF-366- 
Co on the reduction of carbon dioxide in neutral 
aqueous solution. In contrast, the carbon fabric 
electrode alone showed minimal current enhance- 
ment under a carbon dioxide or nitrogen atmo- 
sphere (fig. S56). 

In controlled potential electrolyses performed 
in carbon dioxide-saturated aqueous bicarbonate 
buffer (pH = 7.3) under applied potentials be- 
tween -0.57 and -0.97 V (versus RHE), carbon 
monoxide was observed as the major reduction 
product (Fig. 3B), with no other detectable carbon- 
based products. At -0.67 V, which represents an 
overpotential of -0.55 V, the catalyst displayed 
optimal performance (figs. S63 and S64). More 
positive potentials led to sluggish carbon dioxide 
reactivity, whereas more negative potentials pro- 
moted off-pathway water reduction. At -0.67 V, 
COF-366-Co promoted carbon monoxide evolu- 
tion at an initial current density of 5 mA per mil- 
ligram of catalyst (~80 mA per milligram of 
cobalt), with high selectivity over competing 
proton reduction [Faradaic efficiency for carbon 
monoxide (FEco) = 90%]. Catalytic cobalt por- 
phyrin centers in the COF showed greater than 
10% enhancement in carbon dioxide to proton 
selectivity over the molecular cobalt porphyrin unit 
alone (Fig. 3B). The catalytic activity of the COF 
could be maintained for 24 hours, accumulating 
more than 36 ml of carbon monoxide [standard 
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Fig. 1. Design and synthesis of 
metalloporphyrin-derived 2D 
covalent organic frameworks. 

The space-filling structural 
models of COF-366-M and 
COF-367-M were obtained using 
Materials Studio 7.0 and 
refined with experimental 
PXRD data. 
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temperature and pressure (STP), equivalent to 
1.6 mmol] per milligram of COF. This corresponds 
to a turnover number (TON) of 1352 [TON per 
electroactive cobalt based on the measurements 
described above (TONea) ~ 34,000], with an initial 
turnover frequency (TOF) of 98 hour"^ [TOF per 
electroactive cobalt (TOFea) ~ 2500]. This result 
represents a substantial improvement over the 
molecular Co(TAP), which showed an initial TOF 
of 36 hour"^ (TOFea ~ 360) and a TON of 794 
(TONea ~ 8300) after 24 hours of reaction [sur- 
face coverage of electroactive sites on a Co(TAP)- 
modified electrode constitutes -10% of the total 
cobalt]. PXRD (fig. S73) and SEM (fig. S74) mea- 
surements on the COF catalyst after electrolysis 
showed neither marked changes in crystallinity 
and microscopic morphology nor evidence of co- 
balt nanoparticle formation. By incorporating 
this molecular unit into a COF scaffold, the re- 
sulting hybrid catalyst could be recovered from 
the electrolysis mixture and reused at least five 
times without decrease in activity and selectivity 
(table S8). Control experiments with COFs contain- 
ing either free-base or copper-metallated porphy- 
rin units, a physical mixture of independently 
synthesized cobalt nanoparticles (fig. S75) and 
COF, and C03O4 and related cobalt salts that 
could be formed as impurities during COF syn- 
thesis did not produce appreciable carbon dioxide 
reduction (fig. S76). 

To optimize this carbon dioxide catalyst plat- 
form by a modular reticular approach, we prepared 
the expanded COF-367-Co analog using biphenyl- 



4,4 -dicarboxaldehyde (BPDA) as the strut in place 
of BDA (Fig. 1). We reasoned that a larger pore 
size would allow for higher capacity of carbon 
dioxide adsorption inside the framework, as well 
as higher electrochemical and chemical accessi- 
bility of the catalytic cobalt porphyrin active sites. 
COF-367-Co was obtained as a dark purple pow- 
der comprising rectangular rod-shaped crystal- 
lite aggregates (-100 nm in length) (Fig. 2D). The 
structural model based on PXRD data and DFT 
fitting indicated that the constitution and topol- 
ogy of COF-367-Co is analogous to that of COF- 
366-Co, with the former showing an increased 
channel width (24 A) and interlayer distance 
(4.8 A) (Fig. 2E). The BET surface area was de- 
termined by nitrogen adsorption isotherm (Fig. 
2F) as 1470 m^/g, and pore size distribution was 
consistent with the model (12 to 23 A). Cyclic 
voltammetry studies with COF-367-Co on a car- 
bon fabric electrode in bicarbonate buffer in- 
dicated that this extended organic framework 
exhibits an improved surface concentration (2 x 
10"^ mol/cm^) of electroactive cobalt porphyrin 
sites over COF-366-Co, which corresponds to ac- 
cessibility of 8% of the cobalt sites in the bulk 
material. When the solution was saturated with 
CO2, a catalytic current was evident with an on- 
set potential at -0.40 V and a 2.2-fold enhance- 
ment at -0.67 V, indicating a prominent effect of 
COF-367-Co in the catalysis of CO2 reduction at 
these potentials (Fig. 3A). As predicted, electrol- 
ysis under the same conditions described above 
revealed that this expanded COF displays im- 



proved catalytic efficiency as compared with 
COF-366-Co. At an applied potential of -0.67 V, 
COF-367-Co produced more than 100 ml of carbon 
monoxide (STP, equivalent to 4.5 mmol) per mil- 
ligram of COF during a 24-hour period (TON = 
3901, TONea ~ 48,000), with high Faradaic effi- 
ciency (91%). The increased TONea over COF-366- 
Co indicated that lattice expansion allowed for 
more efficient exposure of the electroactive sites 
to the reactants. 

In addition to framework expansion, we also 
sought to optimize catalyst performance by intro- 
ducing building-block heterogeneity through a 
multivariate strategy (33), as we hypothesized 
that not all electroactive cobalt porphyrin sites in 
the material fully participated in electrocatalysis, 
owing to the low aqueous solubility of carbon 
dioxide. Specifically, we reasoned that using iso- 
structural metalloporphyrin units that are cata- 
lytically inactive for carbon dioxide reduction 
(fig. S76) to dilute electroactive cobalt porphyrin 
active sites within the extended lattice could in- 
crease the proportion of the active sites exposed 
to the reactant and thereby improve the turnover 
frequency on a per-cobalt basis. Copper porphy- 
rin was chosen for this study, as the correspond- 
ing COF-367-Cu had the highest surface area 
among all analogous COF-367-M (M = Ni, Zn, H2) 
synthesized. The resulting bimetallic COF-367 
derivatives, termed COF-367-Co(10%) and COF- 
367-Co(l%), were prepared; the numbers in the 
parentheses indicate the proportion of cobalt in 
all metal sites (Fig. 1). Inductively coupled plasma 







Fig. 2. Structural characterization of COF-366-Co 



2 0 n 



P/Po 



and COF-367-Co. (A) SEM image of COF-366-Co. (B) Experimental (red line) and simulated (blue line) PXRD patterns of COF-366-Co. a.u., arbitrary units. 
(C) Nitrogen adsorption isotherm for COF-366-Co. P, pressure of the system: Pq, saturation pressure. (D) SEM image of COF-367-Co. (E) Experimental (red 



line) and simulated (blue line) PXRD patterns of COF-367-Co. (F) Nitrogen adsorption isotherm for COF-367-Co. 
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(ICP) analyses confirmed that the final chemical 
composition of the COFs obtained reflected the 
initial ratio of the two metalloporphyrin start- 
ing materials that we used (supplementary ma- 
terials section Sl.l). The TOF per electroactive 
cobalt site for carbon monoxide production in 
these multivariate Co/Cu COF-367 catalysts 
showed a substantial improvement with each 
10-fold dilution of cobalt loading. The average 



TOF per active cobalt for the first 4 hours of 
reaction was measured as follows: COF-367-Co, 
165 hour'^ (TOFea = 1900); COF-367-Co(10%), 
360 hour”' (TOFea “ 4400); and COF-367-Co(l%), 
764 hour"' (TOFea = 9400) (Fig. 3C). The TOFea 
values were roughly estimated, assuming that all 
three frameworks had comparable percentages 
of electroactive sites (8%). Although the low 
cobalt concentration in COF-367-Co(10%) and 



COF-367-Co(l%) hampered the accurate deter- 
mination of the surface concentration of elec- 
trochemically accessible cobalt sites by CV, the 
bimetallic frameworks had nearly identical pore 
sizes, interlayer distances, and surface areas to 
the parent COF-367-Co on the basis of PXRD 
and nitrogen adsorption measurements (figs. S25 
and S32 to S43), which suggests that such an 
assumption may be valid. Owing to the moderate 








Fig. 3. Electrochemical and spectroelectro- 
chemical characterizations and carbon dioxide 
reduction performance of the COFs. (A) Cyclic 
voltammograms of COF-366-Co and COF-367-Co 
in carbon dioxide-saturated medium (blue and red 
solid lines, respectively) or nitrogen-saturated me- 
dium (blue and red dotted lines, respectively). The 
black solid line shows background (bare carbon 
electrode) CV responses in carbon dioxide-saturated 
medium. The medium was pH 7.2 aqueous po- 
tassium phosphate buffer (0.2 M) with additives: 

0.5 M KHCO3 under carbon dioxide atmosphere to 
maintain a neutral pH, or 0.5 M NaCI 04 under nitro- 
gen atmosphere to match the ionic strength. (B) 

Long-term bulk electrolyses at -0.67 V (versus RHE), showing the volume of 
carbon monoxide produced by COF-367-Co (red solid triangles), COF-366-Co 
(blue solid circles), or Co(TAP) (black solid squares) and the volume of hy- 
drogen produced by COF-367-Co (red open triangles), COF-366-Co (blue open 
circles), or Co(TAP) (black open squares). (C) Bulk electrolyses of bimetallic 
COFs at -0.67 V (versus RHE), showing TON of carbon monoxide production 
by COF-367-Co(l%) (red solid triangles), COF-367-Co(10%) (blue solid circles), 
COF-367-Co (orange solid squares), or COF-367-Cu (purple solid inverted tri- 
angle: TON with respect to the amount of copper porphyrin). Two separate ex- 
perimental runs were conducted for each time point, with the line showing the 
average of the measurements. (D) Long-term bulk electrolysis of COF-367-Co(l%) 
at -0.67 V (versus RHE). (E) Tafel plots of electrolysis using COF-367-Co (red 
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diamonds), showing only the linear, low-current density regime where the re- 
action rate is not limited by mass transport. (F) X-ray absorption spectra of COF- 
367-Co (orange line), COF-367-Co(10%) (blue line), and Co(TAP) (black line). 
The inset shows the pre-edge regime of the same spectra. (G) Spectroelec- 
trochemical data using in situ UV-Vis, showing the steady-state relative absorb- 
ance at different applied potentials (0.23 to -0.57 V versus RHE) with reference to 
that at 0.33 V. (H) Time dependence of the relative UV-Vis absorbance at 640-nm 
wavelength at -0.57 V, with respect to the steady-state UV-Vis absorbance at 
640 nm at 0.33 V. Potential is applied at 4 s. Linear regression yielded the 
apparent diffusion coefficient (Dgpp) according to a modified Cottrell equation: 

AAbsorption = — (fig. S81) (47). Here, Amax is the maximum absorption 



solid circles), COF-366-Co (blue solid triangles), or Co(TAP) (black solid of the thin films at 640 nm, cf is the overall thickness of the thin films, and t is time. 
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proton reduction ability of the copper porphyrin 
sites in the hybrid organic framework, the TOF 
increased as the Co/Cu ratio decreased, at the 
expense of Faradaic efficiency for carbon mon- 
oxide production [FEoo = 70% for COF-367-Co(10%) 
and 40% for COF-367-Co(l%)]. In a long-term 
electrolysis experiment (Fig. 3D), COF-367-Co 
(1%) displayed a TON of > 24,000 (TONea ~ 
296,000), which indicates that it is one of the 
most efficient electrochemical carbon dioxide 
reduction catalysts reported to date (3, 4). The 
observed activity enhancements for the bime- 
tallic COFs are due to their multivariate nature; 
indeed, the analogous physical mixture of COF- 
367-Co and COF-367-Cu in a 1:9 ratio produced 
far less carbon dioxide reduction activity com- 
pared with COF-367-Co(10%) (table S17). 

In addition to challenges posed by the low 
aqueous solubility and limited diffusion of CO2 
within the COF, only a small portion (4 to 8%) of 
the cobalt centers in the COF material deposited 
in this manner proved to be electroactive, presum- 
ably because of the limited electrochemical con- 
tact between the COF powder and the electrode 
(41). As such, we explored the possibility of grow- 
ing COF directly onto the surface of an electrode 
substrate in the form of oriented thin films (42-44). 
Layers of COF sheets could be successfully syn- 
thesized directly on glassy carbon, fluorine-doped 
tin oxide (FTO), and silicon oxide. Grazing-incidence 
wide-angle x-ray scattering (GIWAXS) patterns 
showed the formation of highly crystalline COF 
thin films (figs. S48 to S51) (45). With the use of 
ICP, the concentration of COF on the substrate 
surface was determined to be 1.3 x lO"^ mmol Co 
per square centimeter, corresponding to a thick- 
ness of -350 nm, which is consistent with images 
obtained by cross-sectional SEM (figs. S54 and 
S55). The charge transport through the COF-366- 
Co thin films was characterized using spectro- 
electrochemistry. Under an applied potential 
more negative than -0.37 V in nitrogen- or car- 
bon dioxide-saturated pH 7 aqueous buffer, the 
UV-Vis spectrum of the FTO-supported COF 
underwent changes attributable to Co(II)/Co(I) 
reduction (Fig. 3G and figs. S78 to S82). Using 
the steady-state spectroscopic response to the 
reduction potentials in combination with the 
Nemst equation, the redox potential (E'1/2) could 
be estimated to be more negative than -0.52 V 
(fig. S83), in agreement with the CV measure- 
ments (Ey 2 ~ -0.67 V) (46). The time-dependence 
of the UV-Vis response of a COF-coated FTO 
electrode to an applied potential of -0.57 V was 
studied in a C02-saturated KHCO3 solution (Fig. 
3G), and a fit of the data to a modified Cottrell 
equation afforded an apparent diffusion coeffi- 
cient of 2 X 10"^^ cm^/s (fig. S81), a value substan- 
tially higher than that obtained with analogous 
metal-organic framework (MOF) thin films bear- 
ing cobalt porphyrin units (47). Using the silicon 
oxide-hosted sample, we measured the direct cur- 
rent conductivity of COF-366-Co to be -10“® S/cm, 
which is also higher than that of the MOF (figs. S84 
and S85). In addition to a presumably increased 
portion of electroactive cobalt sites, the favorable 
chaise-transport properties led to a higher cata- 



lytic efficiency. Under the same electrolysis con- 
ditions, COF-366-Co thin films on glassy carbon 
exhibited a TOF of 665 hour“\ a value seven times 
as high as that of the same material deposited 
on a carbon fabric, with a high current density of 
45 mA/mg and a Faradaic efficiency of 86%. 

Electrokinetics experiments indicated that cova- 
lently linking cobalt porphyrin active sites within 
a COF influences the mechanistic pathways for 
carbon dioxide reduction. For the systems em- 
ploying COF-366-Co, COF-367-Co, and COF-367- 
Co(10%), Tafel plots of the logarithm of current 
density [log(/co)] versus the overpotential (q) 
display comparable slopes between 470 and 
550 mV/dec (where 1 dec is one order of mag- 
nitude) in the linear, lower-current density regime 
(-0.57 to -0.87 V). This result differs considerably 
from what is observed for the molecular Co(TAP) 
analog (270 mV/dec) (Fig. 3E and figs. S65 to S68). 
This difference suggests that, when embedded in 
a structurally and electronically extended frame- 
work, cobalt porphyrin catalyzes carbon dioxide 
reduction by a different mechanism than that 
which pertains with the simple molecular an- 
alog. In addition, both the COF and Co(TAP) 
catalysts showed a first-order rate dependence 
on the partial pressure of carbon dioxide (fig. 
S87) and no rate dependence on the pH (figs. S88 
and S89), indicating that the rate-limiting chem- 
ical step involves the participation of one mole- 
cule of carbon dioxide and no proton. 

Finally, we applied x-ray absorption spectros- 
copy (XAS) as a probe to evaluate how the sur- 
rounding COF influenced the electronic structures 
of incorporated catalytic cobalt porphyrin units 
and, in turn, the associated carbon dioxide re- 
duction mechanism (48, 49). Comparison of the 
cobalt K-edge XAS spectra of Co(TAP), COF-367- 
Co, and COF-367-Co(10%) (Fig. 3F) reveals a sim- 
ilar line size, shape, and position, consistent with 
a formal Co(II) oxidation state for all samples. 
When a reduction potential (-0.67 V) was applied 
to COF-367-Co under a carbon dioxide atmo- 
sphere, the K-edge revealed a line-shape change 
consistent with the partial reduction of Co(II) 
to Co(I) (fig. S96). The two COF catalysts also 
exhibited an additional pre-edge feature not 
present in the molecular Co(TAP) analog (Fig. 
3F, inset); this difference shows that the COF 
environment could directly modulate the elec- 
tronic properties of molecular centers coupled 
into the extended lattice (50, 51). This feature, 
unique to the COFs, was also observed when the 
catalysts were subjected to open-circuit or elec- 
trolysis (-0.67 V) potentials in a bicarbonate 
buffer (figs. S92 to S95). The communication 
between the metal and COF lattice could furnish 
a cobalt center with a more delocalized electronic 
structure, a network-solid analog to noninnocent 
ligand behavior in molecular systems (35). 

Incorporating tunable molecular units within 
an extended COF thus gives rise to electrocata- 
lysts that display advantageous features of both 
molecular and heterogeneous systems and promote 
carbon dioxide reduction to carbon monoxide 
with exceptionally high activity and selectivity 
compared with existing systems (table S21). We 



anticipate that these hybrid molecular-material 
platforms will be applicable to a broad range of 
catalytic applications, particularly those that re- 
quire sustainable electrical and/or solar input 
and aqueous compatibility. 
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GEOPHYSICS 

Stress orientations in subduction 
zones and the strength of suhduction 
megathrust faults 

Jeanne L. Hardebeck 

Subduction zone megathrust faults produce most of the world’s largest earthquakes. 
Although the physical properties of these faults are difficult to observe directly, their 
frictional strength can be estimated indirectly by constraining the orientations of 
the stresses that act on them. A global investigation of stress orientations in subduction 
zones finds that the maximum compressive stress axis plunges systematically trenchward, 
consistently making an angle of 45° to 60° with respect to the subduction megathrust 
fault. These angles indicate that the megathrust fault is not substantially weaker than its 
surroundings. Together with several other lines of evidence, this implies that subduction 
zone megathrusts are weak faults in a low-stress environment. The deforming outer 
accretionary wedge may decouple the stress state along the megathrust from the 
constraints of the free surface. 



S ubduction zones pose a considerable hazard 
as the main source of great earthquakes 
(magnitude >8). Relative to other fault 
types, subduction zone megathrust faults 
may also have unique physical properties, 
due to the sediments and fluids that are en- 
trained by the subducting oceanic plate ( 1 ). The 
presence of fluids, particularly at high fluid 
pressures, can weaken faults substantially ( 2 ). 
Direct evidence suggests that subduction zone 
megathrust faults slip with low frictional resist- 
ance near Earth’s surface ( 3 ). The strength of 
these faults at seismogenic depths is less clear, 
as they can only be studied indirectly. 

The orientation of a fault with respect to the 
stress field can be an indicator of the fault’s 
strength. Faults optimally oriented for frictional 
failure make an angle of -30° to the maximum 
compressive stress axis, Oi. Nonoptimally ori- 
ented faults can also be active, but faults with 
typical frictional strength should not slip at a 
high angle to Oi (>60°) (4). Faults operating at 
a high angle to Oi must be very weak compared 
to the surrounding material to slip at relatively 
low applied shear stress. Similarly, faults ori- 
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ented at a very low angle to Oi (<10°) must also 
be weak. 

The traction at Earth’s surface is zero, requir- 
ing an Andersonian stress state in which one 
principal stress axis is vertical (5). Most crustal 
faulting is consistent with an Andersonian stress 
state ( 6 ). The very shallow dip (-10°) of the upper 
portion of many subduction zone megathrusts 
(7) suggests that, if these faults exist within an 
Andersonian stress state, they operate at a high 
angle to one principal stress axis (and at a low 
angle to the other two) and are therefore fric- 
tionally weak. However, observations from some 
subduction zone locations indicate that the stress 
in these locations is not Andersonian ( 8 - 11 ). 

I systematically investigate the stress orien- 
tations in subduction zones worldwide, to de- 
termine whether the stress state is generally 
non-Andersonian and whether megathrust faults 
are well-oriented for failure. I compile the mo- 
ment tensors ( 12 , 13 ) of earthquakes located 
within 20 km of the subduction zone interface 
(7) to represent the megathrust region, and 
more shallow events to represent the upper 
plate. I stack the events for each subduction 
zone to invert for stress orientation ( 14 ) as a 
one-dimensional function of subduction inter- 
face depth. For the several subduction zones 



near Japan, there are enough moment tensors 
to invert for the two-dimensional spatially vary- 
ing stress field as well. 

The stress state in the subduction zones near 
Japan is generally not Andersonian. The Oi axis 
in most megathrust regions plunges systemat- 
ically trenchward (opposite the direction of sub- 
duction zone dip), generally plunging between 
10° and 50° (Fig. lA). A similar plunge of the 

axis is observed within the upper plate above 
the Japan Trench, Kuril Trench, and Nankai 
Trough (Fig. IG), whereas back-arc extension 
(near-vertical Ci) dominates above the Ryukyu 
and Izu-Bonin trenches. 

Comparing the plunge of the Oi axis in the 
megathrust region to the dip of the subduction 
interface (7) gives the angle of Oi to the megathrust 
fault (Fig. IB). For the Kuril, Japan, and Izu-Bonin 
trenches, the Oi axis is typically at an angle of 20° to 
50° to the fault. For the Ryukyu Trench, the axis 
makes somewhat larger angles of 30° to 80° to the 
fault. In the Nankai Trough, oi is oriented at 10° to 
30° to the fault in the upper 20 km, while at deeper 
depths, the Oi axis is near-vertical. The plunge of 
the oi axis is stable with depth (Fig. 1C), except in 
the Nankai Trough (Fig. ID), and the subduction 
megathrust is generally well oriented for failure 
from the near-surface to at least 60 km depth 
(Fig. IE). The stacked Kuril, Japan, and Izu- 
Bonin trenches (the subducting Pacific plate) show 
a megathrust fault that is optimally oriented for 
failure, at -30° to oi, over the full depth range. 
The megathrust at the Ryukyu Trench is gen- 
erally well oriented for failure at <60° down to 
30 km depth, and is marginally oriented at -60° 
from 30 km to 60 km depth. The exception is the 
Nankai Trough (Fig. IF), which is well oriented at 
-20° to Oi only in the upper 20 km. Below 30 km, 
the megathrust is not oriented for reverse faulting 
within the error bounds. This suggests that 30 km 
is the downward limit of coupling between the 
subducting and overriding plates. 

For all global subduction zones studied, for al- 
most all depths, the axis plunges systematically 
trenchward, with plunge generally between 10° 
and 50° (Fig. 2A). The plunge tends to decrease with 
depth, while the subduction zone dip increases (7), 
so that the angle of Oi to the subduction interface 
is remarkably stable with depth (Fig. 2B). This 
angle is also remarkably similar across subduction 
zones, most falling within the range 45° to 60°. 

The Japan and South American subduction 
zones are anomalous in being more optimally 
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oriented at 20° to 40° as is the Nankai Trough 
in the upper 20 km. The Marianas Trench, sim- 
ilar to the Ryukyu Trench, is at a somewhat 
high angle at 20 to 40 km depth, although in 
both cases angles of <60° are within the error 
bounds. The Mexico Trench is the only case in 
which the megathrust is unambiguously at high 
angle, although only below 40 km, implying 
that this megathrust may be unusually weak at 
depth. These few anomalous stress orientations 
are not correlated with the age of the subduct- 
ing plate (15) or the trench sediment thickness 



(16). The subduction zones at low angle (Japan, 
Nankai, South America) all exhibit strong seis- 
mic coupling (17). However, subduction zones at 
moderate to high angles exhibit a wide range of 
seismic coupling, so there is no clear correlation. 

The most-vertical principal stress axis, ay, 
must be vertical at Earth’s surface. However, 
the systematic plunge of the axis implies that 
Gv is generally not vertical. I invert for stress ori- 
entation the earthquakes near the main Japa- 
nese islands, separated into megathrust and 
upper plate regions and binned by hypocentral 



depth (14), to investigate whether oy becomes 
closer to vertical near the free surface. I include 
the aftershock sequence of the 2011 Tohoku earth- 
quake, which contains many shallow events in 
the megathrust region. The aftershocks must 
be inverted separately because of the impact of 
the Tohoku earthquake on the local stress field 
(8, 9) and on the distribution of earthquakes in 
the upper plate (18). The megathrust zone shows 
no rotation of Gy toward vertical at shallower 
depths near the trench, either before or after the 
Tohoku earthquake (Fig. 3, A and B). The upper 
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Fig. 1. Orientation of the maximum compressive stress axis, gi, for sub- 
duction zones near Japan. (A) Spatial distribution of the plunge of oi near 
the megathrust, positive if gi plunges trenchward (i.e., opposite to the direction 
of subduction zone dip). (B) Spatial distribution of the angle of oi to the sub- 
duction zone interface, positive to drive reverse slip. (C) The plunge of gi near 
the megathrust as a function of subduction zone depth, shown separately for 
the subducting Pacific Plate (Kuril, Japan, and Izu-Bonin trenches) and the 
Ryukyu Trench. Dashed lines indicate 95% confidence range. (D) The plunge of 



Gi near the megathrust for the Nankai Trough. (E) The angle of ci to the sub- 
duction zone interface, as a function of subduction zone depth, for the sub- 
ducting Pacific Plate and the Ryukyu Trench. Dashed lines indicate 95% 
confidence range. Light shading indicates high or low angles implying a poorly 
oriented fault: dark shading indicates angles inconsistent with reverse slip. 
(F) The angle of gi to the subduction zone interface for the Nankai Trough. 
Shading as in (E). (G) Spatial distribution of the plunge of ci in the upper 
plate, positive if oi plunges trenchward. 
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plate, however, demonstrates a clear rotation of 
ov toward vertical with decreasing depth, over 
the whole depth range considered (Fig. 3, C and 
D). The upper plate appears to see the free 
surface at depth, but the megathrust zone does 
not. The difference maybe due to the deforming 
outer accretionary wedge (19). The outer wedge 
permanently deforms in response to the applied 
stresses, including any difference in traction 
between the free surface and the base of the 
wedge, which may decouple the stress along the 
megathrust at greater depths from the con- 
straints of the free surface. 

The angle of to the subduction interface 
indicates the frictional strength of the mega- 
thrust fault. The observed optimal to moderate 
angles imply that the megathrust frictional 
strength is similar to that of its surroundings. 
These angles are inconsistent with models of a 
relatively weak megathrust in a strong crust (20). 
The observed stability of the angle of Oi to the 
subduction interface suggests that the coeffi- 
cient of friction of the megathrust is similar in 
most subduction zones worldwide and does not 
vary substantially from the near-surface to at 
least 60 km depth. The strength of a fault can be 
estimated from the angle of Oi to the fault, using 
a Mohr circle construction (21), if assumptions 
are made about the strength of the surrounding 
material. If the region around the megathrust is 
assumed to be strong and critically stressed (all 
other faults have a typical friction coefficient of 
p = 0.6, and pore pressure is hydrostatic), the 
megathrust coefficient of friction is implied to 
be p = 0.35 to 0.6. If pore pressure is elevated only 
within the megathrust fault zone, the required 
megathrust pore pressure would be 0.3 to 0.6 of 
lithostatic pressure. 

Several lines of evidence suggest that the 
megathrust is weaker than the above assump- 
tions imply. A coefficient of friction of p = 0.08 
is estimated from the heat flow observed at 
shallow depths after the 2011 Tohoku earthquake 
(3), and the coefficient of friction inferred from 
heat flow for subduction zones worldwide is 
generally p < 0.1 (22). Seismological imaging of 
fluids in subduction zones implies nearly litho- 
static pore pressure at greater depths (23, 24). 
Rotations of the stress field due to large sub- 
duction zone earthquakes imply low deviatoric 
stress levels (8, 9). A more likely model, there- 
fore, is that the entire region is at low deviatoric 
stress. In this model, all faults, including the 
megathrust, are weak and support only low shear 
stress. A similar model has been proposed for 
the San Andreas continental transform plate 
boundary (25), despite the very different tec- 
tonic setting. 

If the entire subduction zone region is at low 
strength, this implies that high fluid pressures 
or low friction coefficients are pervasive. We 
rule out a model of a broad area of high fluid 
pressure as seismic observations limit them to 
a narrow plate interface (26). Therefore, other 
active faults in the subduction zone region must 
be weakened as a result of narrow zones of high 
fluid pressure or low friction coefficient. The 
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Fig. 2. Orientation of the maximum compressive stress axis, cri, as a function of depth for sub- 
duction zones worldwide. (A) The plunge of ci near the megathrust, positive if oi plunges trenchward, 
as a function of subduction zone depth. Line colors and styles correspond to subduction zones as shown 
in the key: results from Japan are reproduced in black. The 95% confidence (not shown) is on average 
±8°. (B) The angle of cti to the subduction zone interface as a function of subduction zone depth. Light 
shading indicates high or low angles implying a poorly oriented fault: dark shading indicates angles 
inconsistent with reverse slip. (C) Map with the subduction zones used indicated in red, excluding Japan. 
Numbers correspond to those in the key. 
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Fig. 3. Orientation of the most-vertical principal stress axis, <ry, for the main islands of Japan as a 
function of earthquake hypocentral depth. Error bars indicate 95% confidence range. Color indicates 
which stress axis is most vertical. Depth is relative to sea level. (A) The plunge of oy in the Japan Trench 
megathrust region prior to the 2011 M9 Tohoku earthquake. (B) The plunge of ov in the upper plate of the 
main islands of Japan prior to the Tohoku earthquake. (C) The plunge of oy in the Japan Trench megathrust 
region for the 3 months following the Tohoku earthquake. (D) The plunge of oy in the upper plate of the 
main islands of Japan for the 3 months following the Tohoku earthquake. 
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Stability of the observed stress orientations 
with depth implies that there are no large down- 
dip changes in the relative strengths of the mega- 
thrust and other faults, despite substantial changes 
in temperature and pressure. This suggests that 
all faults in the subduction zone region have 
similar physical mechanisms controlling their 
strength. 
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NEURONAL IDENTITY 

Tuning of fast-spiking interneuron 
properties by an activity-dependent 
transcriptional switch 

Nathalie Dehorter,^’^ Gabriele Ciceri,^* * Giorgia Bartolini/’^ Lynette 
Isabel del Pino/’^f Oscar Marin^’^t 

The function of neural circuits depends on the generation of specific classes of neurons. 
Neural identity is typically established near the time when neurons exit the cell cycle to 
become postmitotic cells, and it is generally accepted that, once the identity of a neuron has 
been established, its fate is maintained throughout life. Here, we show that network 
activity dynamically modulates the properties of fast-spiking (FS) interneurons through 
the postmitotic expression of the transcriptional regulator ErSl. In the adult cortex, ErSl 
protein levels define a spectrum of FS basket cells with different properties, whose relative 
proportions are, however, continuously adjusted in response to neuronal activity. Our 
findings therefore suggest that interneuron properties are malleable in the adult cortex, 
at least to a certain extent. 



F ast-spiking (FS), parvalbumin-expressing 
(P\^) basket cells make up the most abun- 
dant population of GABAergic interneu- 
rons in the cerebral cortex. Although they 
probably represent no more than 5% of 
the total neuronal population, they contribute 
to feedback and feedforward inhibition and 
are critically involved in the generation of net- 
work oscillations that are crucial for sensory 
perception, cognition, and behavior (1-5). In 
addition, FS PV^ basket cells play a prominent 
role in the regulation of plasticity and learning 
(6-9). Although it is likely that different classes 
of FS PV^ interneurons exist, the molecular 
mechanisms regulating their properties remain 
largely unknown (10). 

We previously found that neurons generated 
in the medial ganglionic eminence (MGE) ex- 
press the transcription factor Etvl/ErSl (11). Be- 
cause PV^ interneurons derive from the MGE 
(10), we investigated whether ErSl is expressed 
in this population of interneurons. Although 
ErSl expression in the neocortex has been as- 
cribed only to layer V pyramidal cells (12), we 
observed that this transcription factor is also 
expressed in scattered cells throughout the neo- 
cortex of postnatal mice (Fig. lA). In layer II-III, 
where ErSl^ expressing cells are particularly 
abundant, the majority of these cells are PV^ 
interneurons (fig. SI). Conversely, over 60% of 
PV^ interneurons in layer II-III contain detect- 
able levels of ErSl protein (Fig. IB). 
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Because PV^ interneurons in superficial lay- 
ers of the cortex are heterogeneous (13), we 
wondered whether ErSl expression defines the 
properties of FS intemeurons. We thus performed 
whole-cell recordings in acute slices through the 
primary somatosensory cortex of PV-Cre;RCE 
mice, in which PW interneurons express an en- 
hanced form of green fluorescent protein (GFP). 
We confirmed the identity of FS PW intemeu- 
rons on the basis of well-described features of 
their action potentials (14). Different classes of 
PV^ FS interneurons have distinctive features 
in their pattern of discharge. For example, some 
classes of PW interneurons display a prominent 
delay to the first spike near spike threshold po- 
tential, whereas others do not (15, 16). The ma- 
jority of PV^ interneurons in layer II-III had a 
delayed firing pattern and expressed ErSl (n = 
22/33 cells) (Fig. 1, C and E). By contrast, PV^ 
interneurons with very short latency or purely 
nondelayed firing consistently lacked detecta- 
ble levels of ErSl protein (n = 11/33 cells) (Fig. 1, 
D and E). Thus, although the basic membrane 
properties of these cells are similar (fig. S2A), 
ErSl expression clearly segregates two classes 
of PV^ interneurons on the basis of their phys- 
iological parameters at near-threshold potential 
(Fig. 1, E to H). The correlation between ErSl 
expression and firing pattern seems universal 
among FS interneurons in the telencephalon. 
Chandelier cells, an important class of FS in- 
terneurons that exhibit nondelayed firing (17) 
(fig. S3), and the large majority of PV^ FS cells 
in layer IV, which also display nondelayed fea- 
tures, both lack ErSl expression (fig. S4, A to D). 
By contrast, PW interneurons in the striatum 
displayed a delayed firing pattern and contain 
ErSl (fig. S4, E to H). 

To extend our observations, we analyzed the 
connectivity of FS interneurons in layer II-III 
of the somatosensory cortex. Recordings of min- 
iature excitatory and inhibitory postsynaptic 
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currents (mEPSCs and mlPSCs, respectively) re- 
vealed that Er81^ FS intemeurons receive signif- 
icantly more excitatory synapses and far fewer 
inhibitory contacts than Er81" FS intemeurons 
(Fig. 1, 1 to N, and fig. S2, B to G). 

To determine whether Er81 plays an instruc- 
tive role in the specification of delay-type FS 
intemeurons, we specifically deleted F'rSi from 
the embryonic MGE by breeding Lhx6-Cre mice 
{18) with mice carrying loxP-flanked (F) Er81 
alleles {19). We observed no differences in the 
density and laminar distribution of PV^ inter- 
neurons in the cerebral cortex of control and 
Lhx6-Cre;Er8f^;RCE mice at postnatal day 30 



(P30) (Fig. 2A and fig. S5A), which indicated 
that Er81 is not required for the migration and 
early specification of FS intemeurons. However, 
most FS intemeurons in Er81 conditional mu- 
tants displayed a dramatic reduction in the mean 
latency to the first spike, hyperpolarization of 
the threshold potential for spike, and a lower 
rheobase than control FS intemeurons (Fig. 2B 
and fig. S5, B to D). 

To assess whether Er81 function is also nec- 
essary for the maintenance of delay-type FS in- 
terneurons in the mature cortex, we generated 
conditional Er81 mutant mice using PV-Cre mice, 
which only achieve effective recombination in the 
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Fig. 1. ErSl expression distinguishes PV^ basket cell n subtypes. (A) Er81 protein expression in the 
somatosensory cortex. (B) PVVErSl^ intemeurons are particularly abundant in cortical layer ll-lll, with a 
superficial to deep gradient within this layer (n = 11 P25 mice). (C and D) Representative traces of firing at 
near threshold potential for Er81^ (red) and Er81“ (gray) PV^ intemeurons. NB, Neurobiotin. (E) Mean 
latency to the first spike (n = 22 and 9 cells, respectively: P < 0.001). (F) Individual factor maps from 
principal component analysis performed on recorded cells reveals a significant segregation between Er8r and 
Er8r PV^ intemeurons. (G) Phase plane plots for Er8r and Er81“ PV^ intemeurons. (H) Threshold potential 
for spike is significantly different between Er8r and Er81“ PV^ intemeurons (n = 21 and 11 cells, respectively: 
P < 0.05). (I and K) Schematic of mEPSCs and mIPSCs recordings in PV^ intemeurons. PC, pyramidal cell: 
PV, PV^ interneuron: IN, Interneuron. (M and N) Representative traces. (J and L) Measurements of mEPSC 
(n = 15 and 6 cells, respectively: P < 0.05) and mIPSC {n = 7 and 4 cells, respectively: P < 0.05) frequencies in 
PV^ intemeurons. Scale bars, 100 |im (A) and 10 |im (C) and (D). Graphs represent means ± SEM. 



cortex in young adult mice {20). As in Lhx6-Cre; 
Er8f^;RCE mice, we did not observe differences 
in the density or distribution of PW intemeurons 
in the cortex of PV-Cre;Er8f^;RCE mice com- 
pared with controls at P60 (Fig. 2C and fig. 
S6A). However, the postnatal elimination of 
Er81 also caused a prominent decrease in the 
mean latency to the first spike in FS intemeurons 
from the cortex of PV-Cre;Er8f^;RCE mice (Fig. 
2D and fig. S6, B to D). Because background con- 
ductance inputs may influence the relative delay 
in firing of the first spike, we analyzed this pa- 
rameter while blocking synaptic activity. We 
did not observe changes in the mean latency to 
the first spike at threshold potential in Er81 
mutant intemeurons with or without synaptic 
blockers (69.7 + 19.1 ms and 60.5 + 16.7 ms, 
respectively; n = 5, Student’s t test, P = 0.6), 
which ruled out non-cell autonomous effects 
on these parameters. 

The ability of some FS intemeurons to ex- 
hibit a prominent delay of the first spike near 
spike threshold potential is thought to serve as 
an important gating mechanism of inhibitory 
function that is mediated by Kvl.l-containing 
potassium channels localized to the axon initial 
segment {21). ErSl could therefore control de- 
layed firing in FS intemeurons by regulating 
the expression of Kvl.l channels. We found a 
prominent decrease in Kvl.l protein levels in 
cortical lysates obtained from PV-Cre;Er81^^ ; 
RCE mice compared with controls (Fig. 2E). To 
confirm that ErSl transcriptionally controls the 
expression of Kvl.l in PV^ intemeurons, we 
isolated these cells from the cortex of control 
and PV-Cre;Er8f^;RCE mice using fluorescence- 
activated cell sorting (FACS) and carried out 
quantitative polymerase chain reaction (PCR) 
analyses. We found significantly lower Kvl.l 
mRNA levels in mutant FS intemeurons than 
in controls (Fig. 2F). Finally, we identified ETS 
binding sequences upstream of the transcrip- 
tional initiation site of Kvl.l and demonstrated 
specific binding of ErSl using chromatin immu- 
noprecipitation (ChIP) analysis in cortical lysates 
(Fig. 2G). Although this later experiment lacks 
cellular resolution, the results are consistent 
with the notion that ErSl regulates the intrinsic 
neural excitability of delay-type FS intemeurons, 
at least in part by controlling the expression of 
Kvl.l channels. 

Our initial observation that delay-type and 
nondelay-type FS intemeurons receive, on aver- 
age, different number of excitatory synapses 
(Fig. 1, H to K) prompted us to examine the con- 
nectivity of cortical PV^ intemeurons in condi- 
tional F'rSi mutants. To this end, we first measured 
synaptic activity on FS intemeurons in acute 
slices obtained from control and conditional 
Lhx6-Cre;Er81^^ ;RCE mice. We found a signif- 
icant decrease in the frequency and amplitude 
of mEPSCs in mutant FS intemeurons compared 
with control cells (Fig. 3, A to E), which suggests 
that loss of ErSl during development changes 
the inputs received by these cells. To test whether 
the late removal of ErSl has the same effect, we 
recorded synaptic activity from FS intemeurons 
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in control andPV-Cre;Er8f'^;RCE mice. We found 
a significant decrease in the frequency of mEPSCs 
on mutant FS intemeurons compared with con- 



trols, indicative of a reduced number of excita- 
tory synapses, but no changes in their amplitude 
(Fig. 3, A to F). We also found that the density of 



vesicular glutamate transporter type 1 (VGlutl^) 
terminals contacting the soma of FS interneu- 
rons lacking Fr81 is also reduced compared with 



Fig. 2. Er81 regulates intrinsic properties of PV^ 
interneurons and Kvl.l expression. (A) Normal 
distribution of PV^ interneurons in P30 layer ll-lll 
after embryonic genetic deletion of Er81. (B) Latency 
to the first spike and threshold potential for spike are 
significantly reduced in Lhx6-Cre;Er81 conditional 
mutants (n = 20 and 18 cells, respectively: P < 0.05 
for both values). (C) Normal distribution of PV^ inter- 
neurons in P60 layer ll-lll after postnatal genetic 
deletion of Er81. (D) Latency to the first spike and 
threshold potential for spike is significantly reduced 
in PV-Cre;Er81 conditional mutants (n = 29 and 
20 cells, respectively: P < 0.001 and P < 0.05, re- 
spectively). (E) Representative immunoblots (left) 
and quantification (right) of Kvl.l protein in the 
cortex of control and conditional mutant mice (n = 5, 
P< 0.01). (F) Kvl.l mRNA levels in PV^ interneurons 
isolated through FACS in control and conditional 
mutant mice (n = 4, P < 0.01). (G) ChIP assays on 
cortical tissue showing ErSl binding to Kvl.l pro- 
moter regions (n = 3: P < 0.05 for site 2). Scale bar, 
100 |im (A) and (C). Graphs represent means ± SEM. 
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Fig. 3. Reduced excitatory inputs onto PV interneurons in conditional 
ErSl mutants. (A) Schematic of mEPSCs recordings in PV^ interneurons. 
(B) Representative traces. (C and D) Measurement and cumulative plot of 
mEPSC frequencies in Lhx6-Cre;Er8f^^ (n = 11 and 9 cells, respectively: P < 
0.05) and PV-Cre;Er8f^^ (n = 17 and 10 cells, respectively: P < 0.01) mutants. 
(E) Measurement of mEPSC amplitudes in Lhx6-Cre;Er8f^^ (n = 11 and 9 
cells, respectively: P < 0.05) and PV-Cre;Er8f^^ mutants (n = 17 and 10 cells, 
respectively: P = 0.06). (F) Experimental design for the labeling of sparse con- 



trol and Er81 mutant MGE-derived interneurons. (G) Labeled interneurons 
are sparsely distributed throughout the cerebral cortex at very low density. 
(H and I) Representative images of layer ll-lll PV^ interneurons after viral 
infection in the MCE. (J) Representative traces of mEPSCs from control and 
Er81 mutant interneurons. (K) Cumulative plot of mEPSCs frequencies (n = 10 
and 11 cells, respectively: P = 0.45). (L) Measurement of mEPSC amplitudes (n = 
10 and 11 cells, respectively: P < 0.05). Scale bars, 10 gm and 5 |im (G) and (H) 
and 100 |j,m (I). Graphs represent means ± SEM. 
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controls (fig. S7, A and B), whereas the proba- 
bility of release of these terminals is comparable 
for controls and mutants (fig. S7C). By contrast, 
recordings of mIPSCs from FS interneurons re- 
vealed a strong increase in their frequency with- 
out changes in amplitude in PV-Cre;ErSf^;RCE 
mice compared with controls (fig. S8, A to D). 
This increase likely reveals an increase in the num- 
ber of presynaptic inhibitory contacts received by 
FS interneurons in the absence of Er81, because 
no changes were observed in y-aminobutyric 
acid (GABA) release probability (fig. S8E). 

The observed changes in the inputs received 
by FS interneurons in ErSl conditional mutants 



have two possible explanations. One possibility 
is that ErSl directly regulates the connectivity 
of delay-type FS interneurons, as has been sug- 
gested for primary sensory neurons {22). Alter- 
natively, the changes observed in conditional 
ErSl mutants may reflect homeostatic adapta- 
tions in response to the loss of delay-type FS 
interneurons from cortical networks. To dis- 
tinguish between these two possibilities, we 
ablated ErSl from sparse cortical FS interneu- 
rons in each mouse. We used ultrasound im- 
aging to inject low-titer retroviruses encoding 
Cre and Gfp into the MGE of wild-type and 
ErSl^^ embryos on embryonic day 14.5 (E14.5) 



(Fig. 3F), when FS interneurons are being gen- 
erated. These experiments labeled very few in- 
terneurons that migrate to layer II-III in both 
genotypes and that lack ErSl expression only in 
ErSf^^ mice (Fig. 3, G to I). We then performed 
whole-cell recordings from layer II GFP^ cells in 
P30 acute slices obtained from these mice. We 
found an almost complete loss of delay-type FS 
interneurons in the absence of ErSl, as evidenced 
by a prominent reduction in the mean latency to 
the first spike and the levels of Kvl.l expression 
in FS intemeurons lacking ErSl compared with 
control cells (fig. S9, A to E). In contrast to these 
changes in intrinsic properties, we did not find 





Fig. 4. Activity-dependent changes in ErSl expression. (A) Expression of 
ErSl and beta-galactosidase (pGal) in layer II-III PV^ interneurons. (B and C) 
Representative examples and relative proportion of four groups of layer II-III 
PV^ interneurons based on ErSl and pGal expression. (D) Most PV^ inter- 
neurons contain significant ErSl mRNA levels. (E) Scatter plot of ErSl expression 
and latency to the first spike. Line, linear regression fit {R^) = 0.63 (P < 0.0001). 
(F) Representative traces of recorded PV^ interneurons illustrating the positive 
correlation between delayed firing and ErSl expression. (G) Learning curve 
(RotaRod performance) for motor task. (H) ErSl fluorescence intensity levels 
in PV^ interneurons from control and trained animals (n = S; P < 0.05). (I) Dis- 



tribution of ErSl fluorescence intensities in layer II PV^ interneurons from 
control and trained mice (n = S; P < 0.001). Gells were grouped into three main 
classes on the basis of their levels of fluorescence intensity for nuclear ErSl, < 
1.3 times background levels (-), 1.3 to 1.6 times background (-t-), and > 1.6 times 
background levels (-r-r). (J) Experimental design and distribution of elec- 
troporated pyramidal cells in layer II-III. (K) Mean ErSl fluorescence intensity in 
PV^ interneurons from control and K'/r2.J-electroporated mice (n = 4; P < 0.01). 
(L) Distribution of ErSl fluorescence intensities in layer III PV^ interneurons from 
control and K'/r2.J-electroporated mice (n = 4; P < 0.001), groups as in (I). Scale 
bars, 50 |j,m (A), 5 |j,m (B), and 100 [im (J). Graphs represent means ± SEM. 
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differences in the frequency of mEPSCs (Fig. 3, J 
to L) or the density of VGlutl^ terminals con- 
tacting the soma of FS interneurons lacking Er81 
compared with controls cells (fig. S9, F and G). 

Our results demonstrate that Er81 generates 
cell diversity within FS basket cells in the mouse 
cortex. Because Er81 function is also required for 
the maintenance of delay-type FS intemeurons 
in the adult cortex (as revealed by the analysis 
of PV-Cre;ErSf^;RCE mice), we next wondered 
when Er81 expression begins to distinguish both 
populations of interneurons. We used mice in 
which nLacZ have been inserted in the coding 
region of Er 81 (22) and analyzed 

the fraction of FS interneurons expressing 
P-galactosidase in layer II-III of the somato- 
sensory cortex. The large majority of layer II-III 
PV^ interneurons contained p-galactosidase in 
the adult cortex, including many cells with un- 
detectable levels of nuclear Er81 protein (Fig. 4, 
A to C). This surprising result suggests that al- 
most all FS interneurons transcribe Er81 mRNA. 
Alternatively, because p-galactosidase is known 
to perdure long after being synthesized (23), 
this result may indicate that the majority of FS 
intemeurons may have transcribed at some 
point during development. To distinguish between 
these two possibilities, we performed whole-cell 
patch-clamp recordings in PW intemeurons from 
layers II-III of the adult somatosensory cortex 
and subsequently collected the cytoplasm for 
single-cell reverse transcriptase PCR of Er81. 
Compared with Er81 mutant intemeurons (used 
to calibrate background), nearly all FS basket 
cells in the adult layer II-III of the somato- 
sensory cortex contained substantial and com- 
parable levels of Er81 mRNA (10 out of 11 cells) 
(Fig. 4D), which suggests that the expression of 
Er81 in these intemeurons is regulated post- 
transcriptionally. In addition, we found a direct 
correlation between the levels of nuclear Er81 
protein and the latency to the first spike at near- 
threshold potential in FS intemeurons (Fig. 4, 
E and F). 

These observations prompted us to investigate 
whether the relative proportion of delay-type 
and nondelay-type FS intemeurons remains 
constant in the adult cortex. Er81 transcription 
and activity is modulated by activity-dependent 
changes in intracellular calcium (24, 25), which 
raises the possibility that the complement of 
layer II-III FS interneuron subtypes might be 
adjusted in response to variations in network 
activity (26, 27). To test this hypothesis, we phar- 
macologically modified network activity levels 
in acute cortical slices. KCl or kainate rapidly 
decreased the relative proportion of layer II-III 
Er81^ FS intemeurons within 2 hours (fig. SIO, 
A and B). This reduction in Er81^ intemeurons 
was associated with a parallel decrease in the 
mean latency to the first spike (fig. SIOC). By 
contrast, pharmacological reduction of network 
activity with the L-type calcium channel blocker 
Nifedipine led to a significant rise in the pro- 
portion of Er81^ FS intemeurons and a con- 
comitant increase in the mean latency to the 
first spike of these cells (fig. SIO, D to F). 
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To characterize the activity-dependent regu- 
lation of Er81 expression in more physiological 
conditions, we trained mice in a motor-learning 
paradigm and analyzed the levels of Er81 ex- 
pression in layer II of the motor cortex (Fig. 4G). 
Motor learning significantly decreased the pro- 
portion of Er81^ FS intemeurons compared with 
controls, because of a shift in Er81 levels toward 
lower intensity values (Fig. 4, H and I). To de- 
crease the overall levels of activity in layer II-III 
pyramidal cells, we introduced the inward-rectifier 
potassium ion channel Kir2.1 in their cortical pro- 
genitor cells by means of in utero electroporation 
(28, 29) and examined the levels of Er81 protein 
in layer III of the somatosensory cortex in adult 
mice (Fig. 4J). We found a prominent increase in 
the proportion of Er81^ FS intemeurons com- 
pared with controls, due to a change in Er81 levels 
toward higher-intensity values (Fig. 4, K and L). 
It was interesting that we did not observe com- 
parable changes in other layers of the cortex, 
which suggests that this type of cellular plastic- 
ity mediated by Er81 might be limited to cortical 
layer II-III (fig. Sll). 

Our findings provide a mechanistic explana- 
tion for the activity-dependent regulation of 
interneuron properties through the transcrip- 
tional control of neuronal excitability and the 
modulation of synaptic inputs. Er81 has been 
previously shown to regulate the terminal dif- 
ferentiation of dopaminergic neurons and cer- 
ebellar granule cells (30, 31). In the cortex, Er81 
shapes information processing by modulating 
the intrinsic properties of FS intemeurons 
through the regulation of the Kvl.l channel sub- 
unit, the expression of which is necessary for 
delayed firing (21). Other factors are likely in- 
volved in this process, because loss of Er81 does 
not completely transform fast-spiking interneu- 
rons into nondelayed cells. Kvl.l function is 
also regulated by ErbB4 (32), a tyrosine kinase 
receptor that is highly expressed in the post- 
synaptic density of glutamatergic synapses con- 
tacting FS basket cells (33). This delineates a 
possible pathway through which synaptic acti- 
vity may control Er81 expression to modulate 
the intrinsic properties of FS basket cells. 

Our results also support the notion that activity- 
dependent mechanisms play a prominent role in 
the specification of neuronal properties (34, 35). 
However, in contrast to the classical view in 
which the “specification” typically refers to the 
process by which cells achieve and maintain a 
stable fate independent of environment, our re- 
sults suggest that layer II-III fast spiking basket 
cells may not exist as distinct classes of intemeu- 
rons, but rather as a continuum of intemeurons 
with “tunable” characteristics that adapt to the 
changing environment. The observation that cor- 
tical circuits reversibly adapt to changing levels 
of activity by tuning the efficacy of delayed firing 
within the population of FS basket cells suggests 
a novel mechanism of network plasticity, and re- 
inforces the notion that interneuron function is 
ultimately context dependent (5). 

Recent work has linked different configura- 
tions of FS basket cells, as revealed by distinct 



levels of PV expression, with learning and me- 
mory (7). Our analysis of Er81 conditional mutant 
mice indicate that drastic imbalances in network 
configuration are accompanied by population- 
driven changes in structural synaptic plasticity, 
through which FS basket cells may lose their 
ability to filter out relatively weak, asynchronous 
stimulus. 
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GLOBAL CARBON CYCLE 

The reinvigoration of the Southern 
Ocean carhon sink 
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Several studies have suggested that the carbon sink in the Southern Ocean— the ocean’s 
strongest region for the uptake of anthropogenic CO 2 — has weakened in recent decades. 
We demonstrated, on the basis of multidecadal analyses of surface ocean CO 2 
observations, that this weakening trend stopped around 2002, and by 2012 the Southern 
Ocean had regained its expected strength based on the growth of atmospheric CO 2 . All 
three Southern Ocean sectors have contributed to this reinvigoration of the carbon sink, 
yet differences in the processes between sectors exist, related to a tendency toward a 
zonally more asymmetric atmospheric circulation. The large decadal variations in the 
Southern Ocean carbon sink suggest a rather dynamic ocean carbon cycle that varies more 
in time than previously recognized. 



S imulations with ocean biogeochemical mod- 
els have suggested a stagnation or even a 
reduction of the Southern Ocean carbon 
sink from the 1980s to the early 2000s (1-3), 
a result that has been supported by inver- 
sion studies (1) based on atmospheric CO 2 data. 
Such a stagnation has wide-reaching implications 
for climate, because the Southern Ocean south of 
35°S accounts for about 40% of the global oce- 
anic uptake of anthropogenic CO 2 (4-6), thereby 
removing a disproportionally large share of an- 
thropogenic CO 2 from the atmosphere. The trend 
toward a saturation of the Southern Ocean car- 
bon sink has been attributed mainly to the in- 
tensification and poleward shift of the westerly 
winds associated with a trend toward a more pos- 
itive state of the Southern Annular Mode (1, 2). 
The resulting enhanced upwelling of deep waters 
with high concentrations of dissolved inorganic 
carbon (DIG) drove an anomalously strong flux of 
natural CO 2 out of the surface ocean, counter- 
acting the increase in the oceanic uptake of an- 
thropogenic CO 2 (2). 
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Several studies based on observations of the 
surface partial pressure of CO 2 (PCO 2 ) (7-9) cor- 
roborated these model-based trends in the South- 
ern Ocean carbon sink, but all of them used the 
observations without any interpolation. Given the 
sparsity and spatial heterogeneity of these surface 
ocean observations (8), the conclusions drawn in 
these studies regarding the trends turn out to be 
rather sensitive to the chosen method of trend 
calculation (9) and the beginning and end year of 
analysis (10). Nevertheless, these studies tended to 
support a weakening sink trend up to the mid- 
2000s. One of these studies (9) also pointed out 
that the trend may have reversed in recent years, 
a finding corroborated by the analysis of PCO 2 
observations along a single meridional transect 
south of Tasmania (11). 



To address the sparse data coverage, we used a 
neural network technique (12) to interpolate the 
PCO 2 observations in time and space. We then 
evaluated the results using (i) a complementary 
PCO 2 observation-based product based on the in- 
terpolation by a data-driven mixed-layer scheme 

(13) and (ii) an atmospheric CO 2 inverse estimate 

(14) . Both surface ocean-based methods were ex- 
tended for this study to produce multidecadal dis- 
tributions of the surface ocean PCO 2 field (15-17). 
The air-sea CO 2 flux variations were then com- 
puted with the use of a standard bulk param- 
eterization (supplementary text 1.4). Although 
each of these estimates faces limits due to the 
available information, their combination allows 
us to gain confidence in the inferred features. 

The two surface ocean data-based air-sea CO 2 
flux products confirm that the Southern Ocean 
carbon sink (south of 35°S) weakened through 
much of the 1990s, in agreement with the model- 
based studies and the atmospheric inversions 
(1, 2), but they reveal that it has strengthened 
substantially since about 2002, increasing by 
more than -0.6 Pg of C year“^ (Fig. 1) to a vig- 
orous uptake of -1.2 Pg of C year“^ in 2011. This 
increase has returned the Southern Ocean sink 
to levels expected from the increase in atmo- 
spheric CO 2 (5), computed from an ocean bio- 
geochemistry model forced with just the increase 
in atmospheric CO 2 (18). The increase in the 
Southern Ocean carbon uptake since 2002 is re- 
sponsible for roughly half of the global trend in 
the ocean carbon sink over this period (15), high- 
lighting the importance of the Southern Ocean 
in moderating the growth of atmospheric CO 2 . 

Both surface ocean observation-based methods 
rely on the to-date largest sea surface PCO 2 ob- 
servation database [Surface Ocean CO 2 Atlas 
(SOCAT) version 2] (19), which contains more 
than 2.6 million observations in the Southern 
Ocean south of 35°S. The neural network tech- 
nique (12, 15) interpolates these observations 




Fig. 1. Evolution of the Southern Ocean carbon sink anomaly south of 35°S. The lines show the inte- 
grated air-sea CO 2 flux derived from two complementary surface ocean PCO 2 interpolation methods [a 
two-step neural network technique ( 15 ) and a mixed-layer scheme ( 17 )] as well as the integrated flux 
from an atmospheric inversion based on measurements of atmospheric CO 2 ( 14 ). These estimates are 
compared with the expected uptake based on the growth of atmospheric CO 2 alone, based on a 
simulation with the ocean component of the Community Climate System Model (CCSM3) ( 18 ). All data 
are plotted as anomalies by subtracting the 1980s mean flux from each method. Negative values 
indicate anomalous uptake by the ocean. 
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to a 1° X 1° global grid at a monthly resolution for 
the period from 1982 through 2011, resulting in a 
multidecadal reconstruction of the global ocean 
carbon sink. The method relies on nonlinear but 
robust relations between the limited PCO 2 ob- 
servations and the properties that are measured 
more frequently, such as sea surface temperature, 
sea surface salinity, satellite measurements of 
chlorophyll a, and mixed-layer depth (supplemen- 
tary text 1.2). The mixed-layer scheme (13, 17) (ver- 
sion oc_vl.2), in contrast, does not regress PCO 2 
variations to physical, chemical, or biological driver 
data but directly assimilates the available PCO 2 ob- 
servations into a mass budget of the mixed layer at 
a resolution of 4° x 5° in space and daily in time. 
This method also uses several ancillary observa- 
tions to parameterize the air-sea CO 2 exchange, 
solubility, and carbon chemistry but does not use 
them to interpolate the PCO 2 to regions without 
observations. Instead, it interpolates the PCO 2 
data directly. 

Extensive validation of the neural network- 
based estimate using independent observations 
reveals that the method is able to map the sparse 
PCO 2 data with little bias (with mean differences 
between SOCAT observations and neural network 
estimates of generally less than 2 patm; table SI) 
in space and time. Both methods agree well re- 
garding the sign and the magnitude of the dec- 
adal trends within the two decades from 1992 
through 2001 and 2002 through 2011 (Fig. 1 and 
table S2), when the majority of surface ocean 
PCO 2 observations were made (fig. S3). 

However, given the methodological differences 
in the data treatment in data-sparse regions (in- 
terpolation versus regression), there is less agree- 
ment regarding higher-frequency variability such 
as the year-to-year variations in the sink strength. 
This lower agreement is a result of the weaker 
signal-to-noise ratio of the PCO 2 data in the in- 
terannual frequency band. Under such conditions, 
the direct interpolation scheme of the mixed-layer 
method tends to extrapolate high-frequency noise 
present in the observations to the data-sparse 
regions, probably generating overly strong var- 
iations there. In contrast, the neural network 
scheme suppresses the high-frequency noise by 
being constrained by the ancillary observations, 
resulting in a possible underestimation of the 
year-to-year variability in the data-sparse regions. 
In contrast, the relatively strong PCO 2 signals 
that underlie the decadal changes in the Southern 
Ocean are distinctly captured by the two methods, 
resulting in very similar decadal trends. 

The changes in the Southern Ocean carbon 
sink are almost entirely driven by changes in 
the air-sea difference of PCO 2 , i.e., ApC 02 = 
pC07^^ - pCOf^ (pCOf^ = atmospheric PCO 2 ), 
because the direct effect of changes in the wind 
and temperature on the gas transfer coefficient is 
small (fig. S8). The spatial pattern of the trends 
in ApC 02 from the neural network method re- 
veals for both decades a very uniform trend pat- 
tern across the entire Southern Ocean, with the 
strongest ApC02 trends at high latitudes (Fig. 2, 
A and B). The spatial trends for the mixed-layer 
scheme are similar, although at coarser resolu- 



tion and with somewhat more zonal variations, 
part of which may be spurious because of miss- 
ing data constraints there, reflecting the more 
variance-producing nature of this method in 
data-sparse regions (fig. S6). From 1992 through 
2001, the trend in ApC 02 was strongly positive, 
driven by the surface ocean PCO 2 increasing at 
nearly twice the rate of around Antarc- 

tica. In contrast, from 2002 onward, the growth 
of surface ocean PCO 2 nearly stalled, strongly in- 
creasing the degree of surface ocean undersatura- 
tion, which ultimately drove the increasing uptake 
of atmospheric CO 2 . 

We tested the robustness of this result on the 
basis that such strong decadal changes in the CO 2 



uptake across most of the Southern Ocean should 
leave an imprint on atmospheric CO 2 , taking 
advantage of the lack of land regions with sub- 
stantial CO 2 fluxes south of 35°S. Specifically, we 
used an atmospheric inversion method (14) to 
infer the air-sea CO 2 fluxes that are optimally 
consistent with the atmospheric CO 2 data, while 
taking into consideration atmospheric transport 
and mixing. The setup of the employed version 
s85_v3.7 uses the atmospheric winds from the 
ERA-Interim reanalysis (20). The evolution of 
the Southern Ocean carbon sink from this in- 
version of atmospheric CO 2 data also supports 
our postulated larger-than-expected increase 
in the strength of the Southern Ocean carbon 
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Fig. 2. Trends in ApC02 based on the neural network output and its two components for the two 
analysis decades: 1992 through 2001 and 2002 through 2011. (A) Linear trend in ApCOz for the 
1990s. (B) As (A) but for the 2000s. Linear trend in (C) thermal pCOz and (D) nonthermal ApCOz for the 
1990s. (E) and (F), as (C) and (D) but for the 2000s. Positive (red) ApCOz trends indicate a faster increase 
of pCOz in the surface ocean than in the atmosphere (i.e., a decreasing sink) and vice versa for positive 
(blue) trends. Hatched areas indicate where the linear trends are outside the 5% significance level (P > 
0.05). (G) and (H) illustrate decadal trends of sea level pressure (shading) and 10-m wind (vectors) from 
1992 through 2001 (G) and 2002 through 2011 (H) based on data from the ERA-Interim reanalysis (20). 
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sink within the past decade (Fig. 1), even though 
it shows much less of a weakening during the 
1990s. Thus, two complementary PCO 2 database 
estimates, as well as an atmospheric CO 2 inver- 
sion, confirm that the Southern Ocean carbon 
sink has experienced a significant strengthening 
since the early 2000s. 

This reinvigoration after the early 2000s can- 
not be a simple reversal of the Southern Annular 
Mode-driven wind trend that has been suggested 
to cause the weakening of the Southern Ocean 
carbon sink over the past decades (1, 2), because 
the ERA-Interim reanalysis winds (20) do not 
show such a signal (Fig. 2, G and H). Instead, the 
atmospheric circulation became more zonally 
asymmetric with a wavenumber two pattern, rem- 
iniscent of the lower-frequency pattern of var- 
iability of the Antarctic Qrcumpolar Wave (21). But 
how can this zonally asymmetric forcing result in a 
relatively zonally uniform response of the surface 
ocean PCO 2 ? 

Insight into the drivers is gained by separating 
the ApC 02 trend pattern into a component driven 
by changes in sea surface temperature [the thermal 
trend (Fig. 2, C and E)] and one driven by changes 
in the DIG and/or alkalinity [the nonthermal trend 
(Fig. 2, D and F)] (22). For both analysis periods, 
the trends in the thermal and nonthermal com- 
ponents are generally opposed for any given 
location, which is in line with previous studies 
(22-24). The thermal component shows a sink 
increase in both decades in the Pacific sector, 
where the advection of cold air from Antarctica 
and sea-ice changes led to a persistent surface 
cooling trend (25). In the lower latitudes of the 



Atlantic and Indian sectors, we find a reduced 
thermally driven uptake in the 2000s due to 
surface ocean warming, which is probably related 
to the more asymmetric atmospheric circulation 
that caused a reduced northward Ekman transport 
(fig. Sll) of cold polar waters in these regions. 

In the nonthermal component, we find more 
distinct differences between the two periods. Be- 
tween 1992 and 2001, the nonthermal component 
increased the oceanic PCO 2 over most of the 
Southern Ocean (Fig. 2D), in particular in the 
high latitudes and in the Pacific sector. The es- 
timated changes in Ekman pumping velocity [es- 
timated from ERA-Interim winds (20), fig. Sll] 
support the hypothesis that wind changes led to 
an increased surface divergence and an associated 
upwelling of DIC-rich waters into all sectors of the 
high-latitude Southern Ocean (2) in the first period. 
During the subsequent period, the nonthermal 
component primarily reduced the oceanic PCO 2 
in the Atlantic and Indian sectors and over the 
Antarctic shelf (Fig. 2F). In contrast, this compo- 
nent continued to increase PCO 2 in most of the 
Pacific sector, although at a much weaker rate than 
in the 1990s. This much weaker DIG- and/or 
alkalinity-induced increase inpG02 in the Pacific 
sector could no longer compensate for the ther- 
mal trend, so that the negative trend in the total 
PCO 2 in this region for the period after 2001 is 
dominated by the thermal trend. In contrast, in 
the Atlantic and Indian sectors and over the Ant- 
arctic shelf, the negative nonthermal trend dom- 
inates the thermal changes. Thus, overall, the 
temperature-dominated PCO 2 trend in the Pacific 
sector and the DIG/alkalinity-driven trend in the 



other regions have worked in tandem to prevent 
PCO 2 from increasing across the entire Southern 
Ocean since the early 2000s. Over the same pe- 
riod, atmospheric GO 2 continued to rise strongly, 
resulting in a substantial increase in the under- 
saturation of the surface ocean with regard to at- 
mospheric GO 2 , hence driving a strong increase in 
the flux of GO 2 into the entire Southern Ocean. 

We interpret this zonal asymmetry of thermally 
and DIG/alkalinity-driven changes to be primar- 
ily the result of an increased asymmetry in the 
Southern Hemisphere atmospheric circulation 
in the years since 2001 (Fig. 3). Specifically, the 
conditions became more cyclonically dominant 
in the Pacific sector, and more anti-cyclonically 
dominant in the Atlantic and parts of the Indian 
sector (Fig. 2H). As a result of the associated in- 
crease in the meridional wind components, more 
cold air was advected from the Antarctic conti- 
nent over the Pacific sector, and more warm air 
was advected from subtropical latitudes over the 
Atlantic and part of the Indian sectors. Together 
with the changes in northward Ekman transport 
(fig. Sll), this provides an explanation for the 
strengthened asymmetry in the sea surface tem- 
perature trends, which underlie the thermal trends 
in Fig. 2E. 

A strengthening of the carbon sink in the Pacific 
sector, combined with the further intensification 
of the winds (Fig. 2H) during the 2000s, pro- 
vide a paradox at first sight, because the in- 
creased upwelling in the Pacific sector should 
have increased the surface DIG content further. 
A possible explanation is that the recent stabi- 
lization of the surface waters (26) counteracted 
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Fig. 3. Schematic of the processes 
governing the changes in the ApCOz 
trends in the Southern Ocean since 
2001. The trend toward a zonally more 
asymmetric distribution of the atmo- 
spheric pressure systems in the past 
decade led to stronger meridional 
winds bringing either colder air (Pacific 
sector) or warmer air (Atlantic sector) 
to the open Southern Ocean, causing 
strong cooling of the sea surface in the 
Pacific sector and warming in the 
Atlantic sector. The changes in wind 
also affected the oceanic circulation 
pattern, with the net effect being an 
increase in the DIG/alkalinity-driven 
pCOz component in the Pacific sector 
and a decrease of this component in 
the Atlantic sector (i.e., opposing the 
effect of sea surface temperature on 
pCOz). In the Pacific sector, the effect 
of the cooling trend on PCO2 prevails, 
whereas in the Atlantic sector, the 
effect of circulation/mixing on DIG/ 
alkalinity prevails, also causing a low- 
ering trend in pCOz- Thus, owing to the 
interaction between temperature and 
circulation changes, the zonally 
asymmetric forcing caused a zonally 
relatively symmetric response of the 
ocean carbon sink. 
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the wind-induced upwelling. In the Pacific sector 
and in coastal regions, strong surface freshening 
(26, 27) might have caused most of this stabili- 
zation, whereas in the lower latitudes of the At- 
lantic and Indian sectors, warming stabilized the 
surface waters. The reduction in northward Ekman 
transport to the lower-latitude Atlantic and Indian 
sectors during the 2000s (fig. Sll), which is pro- 
bably the result of the zonally more asymmetric 
atmospheric circulation, also reduced the north- 
ward advection of high-latitude waters, lowering 
the Die content and/or increasing the alkalinity 
at the surface. 

The trend toward a zonally more asymmetric 
atmospheric circulation may be related to long- 
term variations of the tropical sea surface tem- 
perature; i.e., to the more prevalent La Nina 
conditions in the Pacific since the early 2000s (28) 
and the more positive phase of the Atlantic Multi- 
decadal Oscillation over recent decades (29). Alter- 
natively, it may be driven by a zonally asymmetric 
response of the Southern Hemisphere near-surface 
circulation to the anthropogenic forcing (25). 

Our results indicate that Earth’s most impor- 
tant sink for anthropogenic CO 2 (5, 6) is more 
variable than previously suggested and that it re- 
sponds quite sensitively to physical climate varia- 
bility. This also suggests that should current climate 
trends reverse in the near future, the Southern 
Ocean might lose its recently regained uptake 
strength, leading to a faster accumulation of CO 2 
in the atmosphere and consequently an acceler- 
ation of the rate of global warming. 
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PLANT SCIENCE 

Six enzymes from mayapple that 
complete the biosynthetic pathway to 
the etoposide aglycone 

Warren Lau and Elizabeth S. Sattely* 

Podophyllotoxin is the natural product precursor of the chemotherapeutic etoposide, yet 
only part of its biosynthetic pathway is known. We used transcriptome mining in 
Podophyllum hexandrum (mayapple) to identify biosynthetic genes in the podophyllotoxin 
pathway. We selected 29 candidate genes to combinatorially express in Nicotiana 
benthamiana (tobacco) and identified six pathway enzymes, including an oxoglutarate- 
dependent dioxygenase that closes the core cyclohexane ring of the aryltetralin scaffold. 
By coexpressing 10 genes in tobacco— these 6 plus 4 previously discovered— we 
reconstitute the pathway to (-)-4'-desmethylepipodophyllotoxin (the etoposide aglycone), 
a naturally occurring lignan that is the immediate precursor of etoposide and, unlike 
podophyllotoxin, a potent topoisomerase inhibitor. Our results enable production of the 
etoposide aglycone in tobacco and circumvent the need for cultivation of mayapple and 
semisynthetic epimerization and demethylation of podophyllotoxin. 



A lthough numerous clinically used drugs de- 
rive from plant natural products, little is 
known about their biosynthetic genes, 
which prevents access to engineered hosts 
for their production (1). Very few complete 
pathways exist, and only three— artemisinic acid 
(2), the benzylisoquinoline alkaloids (3, 4), and 
the monoterpenoid indole alkaloids (5, d)— have 
been transferred to a heterologous host for cur- 
rent or future industrial production. Knowledge 
of plant pathways is especially stark in compar- 



ison with the >700 bacterial and fungal biosyn- 
thetic pathways that have been characterized (7). 

Podophyllotoxin, a lignan from mayapple, is the 
natural product precursor to the topoisomerase 
inhibitor etoposide (8-10), which is used in dozens 
of chemotherapy regimens for a variety of malig- 
nancies. Although etoposide is on the World Health 
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Organization’s list of essential medicines, pro- 
duction requires isolation of (-)-podophyllotoxin 
from the medicinal plant Podophyllum {11). Sub- 
sequent semisynthetic steps to produce etoposide 
are required for topoisomerase inhibitory acti- 
vity not present in podophyllotoxin. A complete 
biosynthetic route would enable more facile ac- 
cess to etoposide and natural and unnatural de- 
rivatives that are difficult to produce synthetically 
{12). Early steps of podophyllotoxin biosynthesis 
{13-16) involve the unusual enantio- and site- 
selective dimerization of coniferyl alcohol to form 
(+)-pinoresinol and provide a starting point for 
identifying additional biosynthetic genes (Fig. 1). 
However, biosynthetic gene discovery in Podo- 
phyllum is a challenge, because the plant grows 
slowly, the genome is large [~16 Gb {17)'] and 
unsequenced, and methods for constructing mu- 
tants are laborious {18). 

We usedAgro&actenz/m-mediated transient ex- 
pression mN. henthamuma to test candidate genes 
for the podophyllotoxin pathway for two reasons. 
First, this versatile plant host would likely pro- 
duce correctly folded, active proteins from a 
variety of enzyme superfamilies without optimiza- 
tion. Second, we wanted to rapidly and combi- 
natorially express candidate enzymes without 
knowing the order of steps or identities of meta- 
bolic intermediates and without additional clon- 
ing. Combinatorial expression can be accomplished 
by coinfiltrating multiple Agrohacterium strains— 
each harboring a different expression construct— 
and analyzing the resulting plant tissue extracts 
by using untargeted metabolomics to identify 
products. 

In our initial approach to produce the pathway 
intermediate (-)-pluviatolide in N. brnthamiaria 
leaves, we coexpressed three of the four known 
podophyllotoxin biosynthetic enzymes: pinoresinol- 
lariciresinol reductase (PLR), secoisolariciresinol 
dehydrogenase (SDH), and CYP719A23 [dirigent 
protein (DIR) was not required]. Although we 
observed low levels of (-)-pluviatolide in the re- 



sulting leaf extracts, the amount was insufficient 
for detecting downstream intermediates produced 
when coexpressing candidate enzymes (fig. SI). 
No pluviatolide was detected in control experi- 
ments when only green fluorescent protein (GFP) 
is expressed. To enhance (-)-pluviatolide pro- 
duction inplanta, we infiltrated leaves expressing 
CYP719A23 with (-)-matairesinol (isolated from 
Forsythia x mtermedia) 5 days st^erAgrobaoterium 
infiltration. After 1 day, (-)-pluviatolide concentra- 
tions were -75 times those in leaves expressing 
PLR, SDH, and CYP719A23, without substrate in- 
filtration (fig. S2), which provided sufficient (-)- 
pluviatolide to enable candidate enzyme screening. 

To select candidate enzymes for conversion of 
(-)-pluviatolide to the next pathway intermediate, 
we mined the publicly available P. hexandrum 
RNA-sequencing (RNA-Seq) data set from the 
Medicinal Plants Consortium. We noted that all 
known podophyllotoxin genes were highly ex- 
pressed in rhizome, stem, and leaf tissues, and 
we selected candidate genes with similar expres- 
sion profiles (fig. S3). As the order of steps in the 
pathway was not known, we chose four putative 
0-methyltransferases {OMTl-4), 12 cytochromes 
P450 {CYP), and a 2-oxoglutarate/Fe(II)-dependent 
dioxygenase {2-ODD). We infiltrated (->matairesinol 
into leaves each coexpressing CYP719A23 and a 
single candidate enzyme. Liquid chromatography- 
mass spectrometry (LC-MS) analysis revealed the 
consumption of (-)-pluviatolide in tobacco leaves 
coexpressing just one of the candidates, OMT3 
(fig. S4). By computationally comparing untar- 
geted metabolomics data from tissue extracts, we 
identified two compound mass signals unique to 
CYP719A23 + OMT3 samples relative to CYP719A23 
alone: One corresponds to (->5'-desmethoxy-yatein 
(fig. S5); the other, with much lower ion abundance, 
likely derives from the double methylation of 
(-)-matairesinol (fig. S6). Expression of OMT3 
alone, followed by infiltration of (-)-matairesinol, 
results in greater amounts of the doubly methyl- 
ated product, which suggests that this enzyme 



can accept multiple substrates. We recombinant- 
ly expressed OMT3 in Escherichia coli and mea- 
sured its kinetic parameters for (-)-pluviatolide 
methylation [apparent Michaelis constant {K^ = 
1.4 pM and enzymatic rate (A^cat) = 0.72 s"^] (fig. S7). 
OMT3 accepts (-)-matairesinol and (-)-arctigenin 
with much lower efficiency and cannot turn over 
(+)-pinoresinol; these data suggest that OMT3 
catalyzes methylation of pluviatolide to generate 
(-)-5'-desmethoxy-yatein as the next step in the 
pathway. 

We next coexpressed individual candidate CYP 
and 2-ODD enzymes with CYP719A23 and OMT3; 
however, we did not observe consumption of 
(-)-5'-desmethoxy-yatein in leaf extracts, which 
suggested that our set of candidate genes was 
incomplete. We reasoned that additional tran- 
scriptome data from P. hexandrum tissue sam- 
ples with differential expression of pathway 
genes could aid candidate selection. 

The expression of known (-)-podophyilotoxin 
biosynthetic genes is up-regulated in P. hexan- 
drum leaves after wounding {19) (Fig. 2A and fig. 
S8). LC-MS analysis of metabolites in wounded 
leaves (removed from the stem to eliminate the 
possibility of metabolite transport) revealed that 
both (-)-yatein and (-)-deoxypodophyllotoxin [pro- 
posed precursors to (-)-podophyllotoxin {20, 21)] 
accumulate and reach a maximum level 12 to 
24 hours after wounding (fig. S9). Consistent 
with previous reports {16), we did not detect 
(-)-podophyllotoxin or its glucoside in leaf tissues. 

We took advantage of the pathway’s indu- 
cibility and performed RNA-Seq on triplicate 
P. hexandrum leaf samples, 0, 3, 9, and 12 hours 
after wounding, from a single plant with the 
strongest metabolite response. We assembled a 
leaf transcriptome, determined expression levels, 
and used predicted enzyme activities required for 
the missing pathway steps to mine the data for 
gene sequences encoding OMTs, CYPs, 2-ODDs, 
and polyphenol oxidases {PPOs). A computational 
analysis based on expression profile similarity with 
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Fig. 1. Biosynthetic pathway of (-)-podophyllotoxin in P. hexandrum. Uncharacterized steps are indicated by dashed lines. 
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Fig. 2. Expression analysis to identify candidate genes. (A) qRT-PCR analysis of podophyllotoxin biosynthetic genes after P. hexandrum leaf wounding 
(at t = 0 hour). Relative expression levels were normalized to t = 0 hour. Data are average values (three technical replicates) ± one SD. (B) Hierarchical clustering 
of RNA-Seq expression data after filtering by enzyme family and expression level. Heat map depicts the expression levels from a single node from the resulting 
cluster. Color key: Known biosynthetic genes (black! candidate genes (red), genes identified in this report (red with black arrows). 



known pathway genes DIR and CYP719A23 and 
overall expression level yielded seven candidate 
pathway genes: Phex30848 {2-ODD); Phex32688 
(CYP); Phexl3114 {OMTl, previously tested); PhexII59 
{PPO); Phex34339 {PPO); Phex524 {CYP71CU1); 
dx\.dPhjexl5199 {CYP) (fig. SIO to S12). Hierarchical 
clustering analysis of 336 expressed genes, se- 
lected by filtering all data (34,384 total genes; see 
table SI) by enzyme family, revealed a single 
clade of 91 genes; further filtering by expression 
level condensed this clade to 22 genes contain- 
ing six of these seven candidates, three of four 
known pathway genes, and OMT3 (Fig. 2B). 

We individually coexpressed six of these seven 
candidate enzymes (the putative hydroxylases) 
with CYP719A23 and OMT3 in tobacco leaves 
to test for a (-)-5 -desmethoxy-yatein hydroxy- 
lase. We infiltrated leaves with (-)-matairesinol 
4 days after infiltration and harvested a day 
later for LC-MS analysis. In samples coexpress- 
ing Phex524 (CYP71CU1), we observed turnover 
of (-)-5'-desmethoxy-yatein (Fig. 3). A com- 
parison of the leaf metabolomes revealed two 
CYPTlCUl-dependent compound mass signals 
that correspond to the calculated miz of {-)-&- 
desmethyl-yatein [assignment supported by tan- 
dem mass spectrometry (MS/MS)] (fig. S13). The 
earlier eluting mass signal is likely an in-source 
fragmentation ion originating from a glycosylated 
derivative of (-)-5 -desmethyl-yatein produced by 
endogenous tobacco enzymes. Thus, CYP71CU1 
likely catalyzes the next pathway step as part of 
E-ring functionalization. 

To complete the biosynthesis of (-)-yatein, a 
proposed intermediate in the podophyllotoxin 
pathway {21), we tested Phexl3114 (OMTl) for 
the ability to methylate (-)-5'-desmethyl-yatein. 
We infiltrated (-)-matairesinol into tobacco leaves 
expressing OMTl in combination with CYP719A23, 
OMT3, and CYP71CU1. (-)-5'-Desmethyl-yatein 
could not be detected in leaf extracts in which 



OMTl had been coexpressed (Fig. 3 and fig. S14); 
instea4 we detected the accumulation of (-)-yatein. 
Thus, OMTl likely converts (-)-5 -desmethylyatein 
to (-)-yatein as the seventh step in the pathway. 

The remainder of the pathway involves closing 
the central six-membered ring in the arylte- 
tralin scaffold and oxidative tailoring. In our 
initial screen (coexpression with CYP719A23 and 
OMT3), we observed substantial consumption of 
(-)-5 -desmethoxy-yatein in samples coexpressing 
Phex30848 (2-ODD). Computational comparison 
of leaf metabolomes revealed a new 2-ODD- 
dependent compound mass signal that corre- 
sponds to 5'-desmethoxy-deoxypodophyllotoxin 
bearing the required aryltetralin scaffold (assign- 
ment supported by MS/MS analysis) (fig. S14). We 
hypothesize that the reaction mechanism involves 
activation of the 7 carbon by hydroxylation, fol- 
lowed by dehydration and carbon-carbon bond 
formation via a quinone methide intermediate 
(fig. S16). 

Prior feeding studies {21) and our P. hexan- 
drum wounding metabolomics data suggest that 
(-)-yatein is the native substrate for ring clo- 
sure. Therefore, we tested whether 2-ODD could 
also catalyze the conversion of (-)-yatein to 
(-)-deoxypodophyilotoxin inplanta. We expressed 
2-ODD in tobacco leaves along with CYP719A23, 
OMT3, CYP71CU1, and OMTl. Four days after 
Agrohacterium infiltration, we infiltrated leaves 
with (-)-matairesinol and, a day later, harvested 
them for LC-MS analysis. We observed that (-)- 
yatein was consumed in a 2-ODD-dependent 
fashion, and a computational comparison of me- 
tabolite extracts confirmed the accumulation of 
(-)-deoxypodophyllotoxin in tobacco leaves co- 
expressing 2-ODD (Fig. 3 and fig. S17). Thus 2- 
ODD catalyzes oxidative ring closure to establish 
the core of the aryltetralin scaffold. 

We sought to confirm the activities of these 
enzymes by biochemical analysis. We isolated 



microsomes enriched with Phex524 (CYP71CU1) 
after expression in Saccharomyces cerevisiae WATLl, 
and purified Phexl3114 (OMTl) and Phex30848 
(2-ODD) with C-terminal hexahistidine tags after 
expression in E. coli. As expected, incubation of 
(-)-5'-desmethoxy-yatein with CYP71CU1 and the 
reduced form of nicotinamide adenine dinucleotide 
phosphate (NADPH) gave the hydroxylated product, 
(-)-5 -desmethyl-yatein; incubation with CYP71CU1 
and OMTl, and with the cofactors, NADPH and S- 
adenosylmethionine, gave (-)-yatein (fig. S18). In- 
cubation of 2-ODD with (-)-yatein as the substrate 
in the presence of 2-oxoglutarate and Fe^^ yielded 
(-)-deoxypodophyllotoxin. All enzymes showed 
little to no activity on similar substrates under 
identical assay conditions. These data confirm the 
enzyme activities and order of reactions for the 
pathway through (-)-deoxypodophyllotoxin (Fig. 
3 and fig. S19). 

To identity the enzyme involved in what 
we hypothesized to be the final step of (-)- 
podophyllotoxin biosynthesis, the hydroxylation 
of (-)-deoxypodophyllotoxin, we returned to the 
publicly available transcriptome data to identify 
CYPs predominantly and highly expressed in P. 
hexandrum rhizomes, the tissue in which (-)- 
podophyllotoxin is primarily produced. We iden- 
tified six CYP candidates that matched our cri- 
terion (fig. S20). We screened the candidates 
in tobacco by individual coexpression with the 
five (-)-deoxypodophyllotoxin biosynthetic genes, 
starting from CYP719A23 and infiltration of 
(-)-matairesinol. By a comparative metabolo- 
mic analysis, we observed consumption of (-)- 
deoxypodophyllotoxin in leaves coexpressing the 
candidate enzyme, Phl4372 (CYP71BE54), but— 
contrary to our expectation— no (-)-podophyllotoxin 
was detected (Fig. 3). Instead, we observed 
CYP71BE54-dependent accumulation of two com- 
pound mass signals with predicted molecular 
formulas and MS/MS data that correlate to 
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compounds derived from the demethylation of 
(-)-deoxypodophyllotoxin, formally (-)-4'-desmethyl- 
deoxypodophyllotoxin (fig. S21). The earlier elut- 
ing mass signal is likely derived from a glycosylated 
derivative. The observed activity of CYP71BE54 
implies that the demethylated lignans found in 
P. hexandrum (22-24) are a result of enzymatic 
demethylation rather than the failure of OMT3 
to methylate a portion of the lignan flux. Con- 
sistent with this view, CYPTlCUl-enriched micro- 
somes cannot accept (-)-pluviatolide as a substrate, 
which indicates a need for fully methylated sub- 
strate earlier in the pathway. Despite poor expres- 
sion in yeast, isolated CYP71BE54 microsomes 
accepted (-)-deoxypodophyllotoxin as a substrate 
but not other similar molecules (fig. S22). 

Upon screening an additional candidate P450, 
Ph35407 (CYP82D61), we also observed consump- 
tion of (-)-deoxypodophyilotoxin. However, we did 
not detect formation of (-)-podophyllotoxin; 
instead, we observed accumulation of its epimer, 
(-)-epipodophyllotoxin (fig. S23). To confirm the 
activity of CYP82D61 in the context of the late path- 
way enzymes, we infiltrated (-)-matairesinol into 
tobacco leaves expressing CYP71BE54, CYP82D61, 
and the five (-)-deoxypodophyllotoxin biosynthet- 
ic genes starting from CYP719A23. Comparative 
metabolomics demonstrated the accumulation of 
(-)-4'-desmethylepipodophyllotoxin, along with 
two other earlier eluting compound mass signals 
that are likely derived from glycosylated (-)-4'- 
desmethylepipodophyllotoxin derivatives (Fig. 3 
and fig. S24). (-)-4 -Desmethyiepipodophyilotoxin 
is the direct precursor to etoposide, which cur- 



rently is made by chemical modification of podo- 
phyllotoxin. Potent topoisomerase activity of 
etoposide was discovered by serendipitous deriv- 
atization of trace amounts of (-)-4'-desmethyl- 
epipodophyllotoxin glucoside, present in P. 
hexandrum rhizome extracts (8). 

Having discovered six enzymes that complete the 
pathway to (-)-4'-desmethylepipodophyllotoxin, 
we then sought to reconstitute the pathway in N. 
hmtimmiaria from (+)-pinoresinol. We expressed 
DIR, PER, SDH, CYP719A23, and the six enzymes 
that we identified in tobacco leaves and subse- 
quently infiltrated 100 pM (+)-pinoresinol, yield- 
ing 10.3 ng of (-)-4 -desmethyiepipodophyilotoxin 
per mg of plant dry weight. The total amount 
produced is likely even higher, as some of the 
product is derivatized by tobacco enzymes and 
could not be quantified. Less than 1 ng of product 
per mg of plant dry weight was obtained without 
infiltration of (+)-pinoresinol, which suggested 
that native production of this intermediate in 
tobacco is limiting (Fig. 3, B and C). We also pro- 
duced (-)-deoxypodophyllotoxin and (-)-epipo- 
dophyllotoxin starting from (+)-pinoresinol in 
N. henthamiana by omitting CYP71BE54 and 
CYP82D61, and CYP71BE54, respectively (figs. 
S25 and S26); pathway intermediates do not ac- 
cumulate in either case (fig. S27). The yield of 
(-)-deoxypodophyllotoxin in tobacco (~90 ng/mg 
dry weight) is more than one-third of the yield 
from wound-induced leaves of Podophyllum. 

Thus, the etoposide aglycone, (-)-4 -desmethyi- 
epipodophyllotoxin, can be produced in N. 
henthamiana, which circumvents the current 



need for mayapple cultivation and subsequent 
semisynthetic epimerization and demethylation 
(fig. S28). By coupling transcriptome mining with 
combinatorial expression of candidate enzymes 
in tobacco, we identified six biosynthetic enzymes, 
including a 2-ODD that catalyzes the novel C-C 
bond-forming step for stereoselective cyclization 
to close the aryltetralin scaffold and a late-stage 
P450 to unmask the E-ring phenol. A similar ap- 
proach could be used to engineer synthetic path- 
ways that produce podophyllotoxin derivatives 
with improved bioactive properties. 
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ANTIVIRAL IMMUNITY 

Viruses transfer the antiviral second 
messenger cGAMP between cells 

A. Bridgeman,^ J. Maelfait,^ T. Davenne/ T. Partridge,^ Y. Peng,^ A. Mayer, ^ T. Dong,^ 

V. Kaever,^ P. Borrow,^ J. Rehwinkel^* 

Cyclic GMP-AMP synthase (cGAS) detects cytosolic DNA during virus infection and induces 
an antiviral state. cGAS signals by synthesis of a second messenger, cyclic GMP-AMP 
(cGAMP), which activates stimulator of interferon genes (STING). We show that cGAMP is 
incorporated into viral particles, including lentivirus and herpesvirus virions, when these 
are produced in cGAS-expressing cells. Virions transferred cGAMP to newly infected cells 
and triggered a STING-dependent antiviral program. These effects were independent of 
exosomes and viral nucleic acids. Our results reveal a way by which a signal for innate 
immunity is transferred between cells, potentially accelerating and broadening antiviral 
responses. Moreover, infection of dendritic cells with cGAMP-loaded lentiviruses 
enhanced their activation. Loading viral vectors with cGAMP therefore holds promise for 
vaccine development. 



T ype I interferons (IFNs) play pivotal roles in 
the immune response to vims infection (i). 
IFN expression is induced by signaling path- 
ways activated by sensors of vims presence, 
including cytosolic DNA sensors (2, 3). Cy- 
clic GMP-AMP synthase (cGAS) is a cytosolic DNA 
sensor that signals by catalyzing the synthesis of 
a second messenger, cyclic GMP-AMP (cGAMP) 
(4, 5). cGAMP binds to and activates stimulator 
of interferon genes (STING) (5, 6), which plays a 
central role in cytosolic DNA sensing by relaying 
signals from DNA sensors to transcription fac- 
tors driving /FAT gene transcription (3, 7). 

DNA viruses and retroviruses trigger cGAS- 
dependent IFN responses in infected cells {8-lS). 
This is thought to involve sensing by cGAS of 
viral DNA, leading to IFN gene transcription in 
the same cell where DNA detection occurred or 
in neighboring cells connected by gap junctions 
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{16). However, it is conceivable that IFN induc- 
tion upon virus infection could also occur in- 
dependently of cGAS if the infecting virus were 
to incorporate and transfer the cGAMP second 
messenger. For example, human immunodefi- 
ciency vims 1 (HIV-1) particles incorporate host 
molecules such as APOBEC3G (77). Given this 
precedent, we hypothesized that cGAMP can be 
packaged into virions and elicits an IFN response 
in newly infected cells independently of cGAS ex- 
pression by the latter, allowing for potentiation 
of innate antiviral immunity. 

To test this idea, we produced HIV-l-based 
lentiviral vectors by plasmid transfection in 293T 
cells, a human cell line that does not express 
cGAS (4). Vims particles were pseudotyped with 
vesicular stomatitis vims glycoprotein (VSV-G), 
and the viral genome contained enhanced green 
fluorescent protein (EGFP) in the Env open read- 
ing frame. These vimses, henceforth referred to 
as HIV-l-GFP, are replication incompetent due 
to the lack of functional Env. Some 293T cells 
were cotransfected with expression constructs 
for either wild-type mouse cGAS (m-cGAS) or 
catalytically inactive m-cGAS-G198A/S199A 
(m-cGAS-AA) (4). Titrated vims stocks were then 
used to infect fresh human embryonic kidney 
293 (HEK293) cells, which express endogenous 



STING (7, 18) and induce IFN in response to 
cGAMP (fig. SI, A to D). HIV-l-GFP collected 
from cGAS-expressing cells triggered induction 
of an IFN^ promoter reporter, whereas viruses 
produced in the absence of exogenous cGAS or 
in the presence of mutant cGAS did not (Fig. lA). 
Next, we analyzed IFN secretion by transferring 
supernatants from infected cells to a reporter cell 
line (HEK293-ISRE-luc), in which firefly lucifer- 
ase expression is driven by interferon-stimulated 
response elements (ISREs) (fig. SIE). Only virus 
stocks produced in wild-type cGAS-expressing 
cells triggered IFN secretion (fig. SIF). Moreover, 
infected cells induced IFI44 and IFITl mRNAs 
specifically when cGAS was present in vims pro- 
ducer cells, further demonstrating induction of 
interferon-stimulated genes (ISGs) (fig. SIG). We 
made similar observations when infecting the 
myeloid cell line THPl (fig. S2). Next, we infected 
primary mouse bone marrow-derived macro- 
phages (BMDMs). IFN and ISGs were induced in 
BMDMs that had been infected with HIV-l-GFP 
produced in cGAS-reconstituted 293T cells (Fig. 1, 
B and C). STING-deficient BMDMs did not in- 
duce IFN and ISGs in response to the same vims 
preparations, although retinoic acid-inducible 
gene I (RIG-I)-dependent IFN production trig- 
gered by Sendai vims (SeV) was normal (Fig. 1, B 
and C). The increased IFN production triggered 
by HIV-l-GFP was functionally relevant, because 
infection with HIV-l-GFP produced in the presence 
of cGAS conferred a STING-dependent antiviral 
state against subsequent challenge with encepha- 
lomyocarditis vims (EMCV) or herpes simplex 
vims 1 (HSV-1) (Fig. ID). 

To exclude the possibility that transfer of 
plasmid DNA or of a soluble factor accounts 
for IFN production by freshly infected cells, 
we treated virus preparations with deoxyribo- 
nuclease or pelleted virions by centrifugation. 
Neither treatment affected the ability of HIV-l- 
GFP produced in cGAS-expressing cells to in- 
duce IFN (fig. S3, A and B). The IFN response 
in target cells was independent of reverse tran- 
scription and integration, as shown by pharmaco- 
logical inhibition with nevirapine and raltegravir, 
respectively (fig. S3C). Vims-like particles lacking 
the viral RNA genome induced IFN in target cells 
when collected from cGAS-expressing produc- 
er cells (fig. S3D). These observations demon- 
strate that neither the viral genome nor its 
reverse transcription products account for IFN 
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Fig. 1. HIV-l-GFP produced in cGAS-reconstituted 293T cells induces 

IFN. (A) HEK293 cells were transfected with pl25-F-Luc (/F/Vp promoter 
reporter) and pRL-TK. After 6 to 8 hours, cells were infected [multiplicity of 
infection (MOI) = 1] with HIV-l-GFP from producer cells expressing cGAS as 
indicated. F-Luc activity was analyzed after 24 hours and normalized to R-Luc. 
m-cGAS-AA is a catalytically inactive mutant (N = 2 biological replicates, 
average and individual values are shown). (B) BMDMs of the indicated geno- 
types were infected with HIV-l-GFP (MOI = 5) or SeV (wedges: MOI=l, 0.5, 0.1). 
Supernatant was tested after 24 hours for mIFNa by enzyme-linked immuno- 
sorbent assay (ELISA) (n.d.: not detectable: dashed line: lower limit of de- 
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tection; N = 2 biological replicates, average and individual values are shown). 
(C) BMDMs were infected as in (B) and the indicated mRNAs were quantified 
relative to GAPDH mRNA by reverse transcription-quantitative polymerase 
chain reaction (RT-QPGR) (N = 4 replicates: error bars indicate SD, and sig- 
nificance was determined by analysis of variance (ANOVA): ****p < 0.0001; 
***p < 0.001; **P < 0.01. (D) Lung fibroblasts were treated with IFN-A/D or 
infected with HIV-l-GFP (MOI = 2). Gells were then infected with EMGV or 
HSV-1 (wedges: fourfold dilutions starting at MOI = 4 or MOI = 64, respectively). 
After 24 hours, cells were stained with crystal violet. Data are representative 
of three or more independent experiments. 



induction in this setting. Substitution of VSV-G 
with thogotovirus glycoprotein did not dimin- 
ish the IFN-inducing property of virus stocks 
from cGAS-expressing cells, showing that these 
effects are not related to VSV-G pseudotyping 
(fig. S3E) (19). 

HIV-l-GFP stocks are likely to contain exosomes 
and other enveloped vesicles such as apoptotic 
bodies. Treatment of cGAS-reconstituted producer 
cells with the exosome inhibitor GW4869 (20) 
or the caspase inhibitor Ac-DEVD-CHO during 
virus preparation did not affect IFN induction 
by HIV-l-GFP (Fig. 2A). Reduced amounts of the 
exosome marker syntenin (21) in virus prepara- 
tions and diminished cleavage of caspase 3 in virus 
producer cells confirmed the efficacy of these com- 
pounds (Fig. 2B). We then separated virions from 
exosomes and other extracellular vesicles by den- 
sity gradient centrifugation (22) and found that the 
fraction containing most of the viral p24 protein 
and of VSV-G was also the most potent fraction 
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inducing IFN (Fig. 2C). Next, we transfected 293T 
cells with cGAS plasmid alone and prepared mock 
‘Virus” stocks. These preparations did not induce 
detectable amounts of IFN (Fig. 2D), suggesting 
that vesicles constitutively shed by cells do not ac- 
count for IFN induction. Moreover, overexpres- 
sion of tetherin, which inhibits virion release 
(23), diminished IFN induction (Fig. 2E). To- 
gether, these observations show that most of the 
IFN-inducing activity is associated with virions. 

Next, we prepared small-molecule extracts 
from HIV-l-GFP, which we then added to PMA 
(phorbol 12-myristate 13-acetate)-differentiated 
THPl cells that were mildly permeabilized with 
digitonin (fig. S4). HIV-l-GFP extracts collected 
from wild-type cGAS-reconstituted producer 
cells induced IFN secretion, whereas HIV-l-GFP 
extracts collected from catalytically inactive 
m-cGAS-AA-reconstituted producer cells failed to 
induce IFN secretion (Fig. 3A). Preincubation of 
extracts with snake venom phosphodiesterase I 



(SVPDE), which cleaves cGAMP (24), abrogated 
this effect (Fig. 3B). To further test whether 
cGAMP is present in lentiviral particles, we trans- 
fected virus producer cells with biotin-labeled 
cGAMP. Biotin-cGAMP was detectable in virus 
preparations (Fig. 3C). Next, we quantified the 
amount of cGAMP present in HIV-l-GFP prepa- 
rations using mass spectrometry. cGAMP was de- 
tectable only in extracts from virus produced in 
cGAS-expressing cells (Fig. 3D). Based on a 
calibration curve and the efficiency of extrac- 
tion (fig. S5), we found that 2.50 x 10"^^ mol of 
cGAMP were present in virus stocks per in- 
fectious unit (Fig. 3D). A similar estimate was 
obtained from the data in Fig. 3A. 

These results show that cGAMP can be pack- 
aged into HIV-l-GFP and induces IFN via STING 
in newly infected cells. Similarly, we found that 
an HIV-2-based, replication-deficient lentivirus 
produced in cGAS-expressing cells induced IFN 
secretion in HEK293 cells (fig. S6). To further 
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Fig. 2. IFN induction triggered by HIV-l-GFP from 
cGAS-expressing cells is mediated by virions. (A, C 

to E) HEK293 cells were infected for 24 hours and then 
washed: after an additional 48 hours, secreted IFN was 
analyzed by bioassay. (A) HIV-l-GFP was produced in 
cells reconstituted with m-cGAS that were also treated 
with 20 |iM GW4869 or 60 |iM Ac-DEVD-GHO, or were 
left untreated (control). Fresh cells were infected with 
pelleted virus (wedges: 10-fold dilutions: N = 3 bio- 
logical replicates). Differences between groups were 
not statistically significant (ANOVA). (B) Pelleted vi- 
ruses from (A) (top) or whole-cell extracts (bottom) 
from virus producer cells were tested by Western blot. 

FL: full length: GL: cleaved caspase 3. (C) HIV-l-GFP 
from cGAS-expressing cells was loaded onto density 
gradients. Wedges indicate increasing iodixanol con- 
centration in 12 collected fractions. Twenty and 2 |j,l of 
each fraction and of the input were tested. p24 con- 
centrations were determined by ELISA (bottom left) and p24 and VSV-G were analyzed by Western blot (bottom right). (D) 293T cells were transfected as 
indicated to produce mock “virus” stocks (wedges: 10-fold dilutions: N = 3 biological replicates). (E) Vpu-deficient lentivectors produced in cells expressing cGAS 
and either wild-type or mutant tetherin were tested. The fraction of infected cells was determined by flow cytometry (right). N = 3 biological replicates. Error bars 
are +SD, and significance was determined by ANOVA (A and D) or unpaired t test (E) (****p < 0.0001: *P < 0.05). Data are representative of two (G) or three (A, 
B, D, and E) independent experiments. 
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Fig. 3. Small-molecule extracts from HIV-l-GFP 
generated in cGAS-reconstituted producer cells 
induce IFN and contain cGAMP. (A) Extracts from 
viruses produced in the absence of cGAS or in the 
presence of wild-type (m-cGAS) or mutant cGAS 
(m-cGAS-AA) were added to digitonin permeabilized 
THPl cells. IFN in THPl supernatants was assessed 
by bioassay. Gray wedges represent a 1:2 dilution 
series starting with extract from 10^ infectious units. 
As controls, synthetic 2'3'-cGAMP was either directly 
added to THPl cells (gray bars) or was spiked into 
medium and then included in the extraction proce- 
dure (orange bars). Black wedges represent a 1:3 
dilution series starting with 50 ng of 2'3'-cGAMP. 
Extraction efficiency (fig. S5B) was taken into account 
to estimate the amount of cGAMP per infectious unit 
(dotted line). (B) Extract from 10^ infectious units of 
HIV-l-GFP produced in the presence of cGAS was in- 
cubated with or without SVPDE for 1 hour and then ad- 
ded to digitonin-permeabilized THPl cells. IFN in THPl 
supernatants was assessed by bioassay. Wedges repre- 
sent a 1:3 dilution series. (C) HIV-l-GFP produced in the 
absence or presence of cGAS or in biotin-cG AMP- 
transfected cells was probed by dot blot for biotin 
(left). The stripped membrane was reprobed for p24 
(right). Wedges represent a 1:10 dilution series starting 
with 2 X 10® infectious units. (D) cGAMP concentration 
in samples from (A) was analyzed by mass spectrom- 
etry. n.d., not detectable. Data are representative of two 
(B) or three (A, G, D) independent experiments. 
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explore the relevance of our findings, we ex- 
tended our study from replication-deficient, 
VSV-G-pseudotyped lentiviruses to fully infec- 
tious HIV-1 bearing a native CXCR4-tropic en- 
velope glycoprotein. Human monocyte-derived 
dendritic cells (hMDDCs) infected with HIV-1 
that had been produced in 293T cells reconsti- 
tuted with wild-type cGAS induced the expres- 
sion of IFN and of CD86, encoded by an ISG 
(Fig. 4A and figs. S7 to S9). Density gradient 
centrifugation demonstrated that most of the 
CD86-inducing activity fractionated together with 
infectious virus particles (Fig. 4B). Next, we pre- 
pared extracts and found that these induced IFN 
secretion by THPl cells (Fig. 4C). Mass spec- 



trometry confirmed that cGAMP was specifically 
present in extracts from HIV-1 stocks from cGAS- 
expressing cells (Fig. 4D). These results show 
that cGAMP is not only incorporated into VSV- 
G-pseudotyped lentivectors but can also be in- 
corporated into fully infectious HIV-1. However, 
these experiments relied on overexpression of 
cGAS in virus producer cells. We therefore asked 
whether triggering of endogenous cGAS dur- 
ing virus infection results in the packaging of 
cGAMP into progeny virus particles. We used 
mouse cytomegalovirus (MCMV), an enveloped 
DNA virus, for these experiments (fig. SIO) be- 
cause this virus induces STING-dependent IFN 
responses (25). We propagated MCMV in primary 



mouse embryonic fibroblasts (MEFs). Extracts 
from MCMV stocks collected from wild-type cells 
contained an activity that induced IFN secretion 
and ISG expression in THPl cells, whereas ex- 
tracts from virus from cGAS-deficient cells did 
not (Fig. 4, E and F). Mass spectrometry confirmed 
the presence of cGAMP in MCMV stocks from 
wild-type cells (Fig. 4G). Similarly, virus produced 
in Sting~^~ cells contained cGAMP, whereas no 
cGAMP was detectable in virus preparations 
from cGas~^~ cells (Fig. 4G). Consistent with this 
observation, MCMV produced in cGAS-sufficient 
MEFs induced IFNa secretion by cGas~^~ BMDMs 
upon infection, but failed to do so in Sting~^~ 
cells (Fig. 4H). In contrast, the response to MCMV 
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Fig. 4. Fully infectious HIV-1 
and MCMV incorporate 
cGAMR (A) hMDDCs from two 
blood donors were infected with 
HIV-1 produced in the presence 
of wild-type (m-cGAS) or 
mutant cGAS (m-cGAS-AA). 

GD86 and p24 expression were 
determined by flow cytometry: 

MFI: mean fluorescence intensity 

(N = 2 infections per donor; average and individual values are shown). 

(B) HIV-1 produced in the presence of wild-type cGAS was fractionated on 
density gradients and analyzed as in (A). GD86 MFI in uninfected (u.i.) cells was 
set to 1 (/V = 2 infections per donor; average and individual values are shown). 

(C) Extracts from HIV-1 were tested as in Fig. 3A. wedges: threefold dilutions 
(N = 2 biological replicates: average and individual values are shown). (D) 2'3'- 
cGAMP concentration in samples from (G) was analyzed by mass spectrometry, 
n.d., not detectable. (E and F) MGMV was propagated in primary MEFs of the 
indicated genotypes. Virus extracts or 2'3'-cGAMP was added to permeabilized 
THPl cells. Secreted IFN was assessed by bioassay [(E), N = 2 biological 
replicates: average and individual values are shown] and the indicated mRNAs 
were quantified by RT-QPGR relative to GAPDH mRNA [(F), N = A replicates]. 
Wedges represent a 1:3 dilution series starting with extract from 250,000 in- 
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from cGAS-deficient producer cells required both 
cGAS and STING. Furthermore, lEN^, ISG, and 
chemokine mRNA up-regulation after infection 
with MCMV from wild-type producer cells was 
partially cGAS-independent but fully STING- 
dependent (Fig. 41). Taken together, these results 
demonstrate that cGAMP produced by endoge- 
nous cGAS is incorporated into MCMV particles, 
which contributes to IFN induction in newly in- 
fected cells in a cGAS-independent but STING- 
dependent manner. 

Incorporation of cGAMP into virions may broad- 
en the spectrum of cells that initiate an IFN 
response. For example, HIV-1 reverse transcrip- 
tion, which is inhibited in some cells by SAMHDl 
(26), is not required for virus sensing via cGAMP 
transfer. This Trojan horse mechanism may also 
accelerate the IFN response. We speculate that 
cGAMP packaging had been overlooked in pre- 
vious studies ovdng to virus production in cGAS- 
negative cell lines such as 293T. Whether the 
incorporation of cGAMP into virus particles is 
a selective process or is based on diffusion re- 
mains to be determined. It is likely that a lipid 
envelope is required to encompass cGAMP in 
virus particles. Indeed, nonenveloped adeno- 
virus produced in cGAS-reconstituted cells did 
not induce IFN in newly infected cells (fig. Sll). 
In our experimental settings, cGAMP was pri- 
marily transferred by virions; however, these 
data do not exclude a role of extracellular ve- 
sicles such as exosomes in shuttling cGAMP be- 
tween cells in other models (for example, sterile 
inflammation). Although it remains unclear 
whether HIV virions produced during natural 
infection contain appreciable amounts of cGAMP, 
our observations have important translational 
implications. cGAMP delivery via lentiviral vec- 
tors results in heightened activation of DCs 
(figs. S8, S9, and S12). This could be harnessed 
in vaccination settings in which viral vectors 
codeliver to DCs cyclic dinucleotides as innate 
stimuli, as well as viral antigens. Moreover, IFN 
induction by oncolytic viruses has been reported 
to enhance tumor killing (27) and may be ac- 
hieved by cGAMP loading. In sum, we have iden- 
tified a mechanism by which a signal for innate 
immunity is transferred between cells. 
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ANTIVIRAL IMMUNITY 

Transmission of innate immune 
sign alin g by packaging of cGAMP in 
viral particles 
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Infected cells detect viruses through a variety of receptors that initiate cell-intrinsic innate 
defense responses. Cyclic guanosine monophosphate (GMP)-adenosine monophosphate 
(AMP) synthase (cGAS) is a cytosolic sensor for many DNA viruses and HIV-1. In response 
to cytosolic viral DNA, cGAS synthesizes the second messenger 2'3'-cyclic GMP-AMP 
(cGAMP), which activates antiviral signaling pathways. We show that in cells producing 
virus, cGAS-synthesized cGAMP can be packaged in viral particles and extracellular 
vesicles. Viral particles efficiently delivered cGAMP to target cells. cGAMP transfer by viral 
particles to dendritic cells activated innate immunity and antiviral defenses. Finally, we 
show that cell-free murine cytomegalovirus and Modified Vaccinia Ankara virus contained 
cGAMP. Thus, transfer of cGAMP by viruses may represent a defense mechanism to 
propagate immune responses to uninfected target cells. 



T o protect multicellular organisms against 
viruses, it is vital that infected cells trigger 
antiviral defense responses that can be rap- 
idly transmitted to noninfected cells. Cells 
are equipped with cytosolic viral sensors 
that recognize viral infection and induce innate 
immune responses (1). The resulting innate im- 
mune responses can restrain the pathogen, re- 
pair the host, and shape an adaptive immune 
response (2). Cyclic guanosine monophosphate 
(GMP)-adenosine monophosphate (AMP) syn- 
thase (cGAS) is a cytosolic DNA sensor that syn- 
thesizes 2'3 -cyclic GMP-AMP (cGAMP), a second 
messenger that binds to the STING protein and 
activates a type I interferon (IFN) response (3-5). 
cGAS is essential to induce an antiviral re- 
sponse against several DNA viruses, as well as 
HIV (6-8). 



Dendritic cells (DCs) link innate sensing of 
pathogens to induction of adaptive immune re- 
sponses. In DCs, the cGAS-cGAMP pathway also 
activates the expression of cytokines and co- 
stimulatoiy molecules for T cell activation, similar 
to adjuvants (7, 9). In human DCs, HIV-1 normal- 
ly evades sensing by cGAS, suggesting a crucial 
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Fig. 1. cGAS lentiviral vector activates monocyte-derived DCs. (A) Reporter blue fluorescent protein 
(BFP) and CD86 expression in DCs after infection of monocytes with a lentivirus coding BFP-2A or BFP- 
2A-CGAS, in the presence or absence of Vpx. (B) CD86 expression as in (A), with titrated virus without 
Vpx and statistical analysis on the highest dose (n = 4 independent donors combined from two 
experiments, paired t test; ***P < 0.001). Bars indicate average. (C) CXCLIO production as in (A), with 
titrated virus without Vpx and statistical analysis on the highest dose (n = 4 independent donors combined from two experiments, paired t test on log-transformed 
data: **P < 0.01). (D) BFP and CD86 expression in DCs after infection of monocytes with a lentivirus coding BFP-2A or BFP-2A-cGAS or with VLPs produced in the 
presence of a nonlentiviral plasmid encoding for cGAS (PSTCD-cGAS). (E) CD86 expression and CXCLIO production as in (D) [n = 5 independent donors combined 
from three experiments, one-way analysis of variance (ANOVA) with post-hoc Tukey test for CD86 expression analysis, one-way ANOVA with post- hoc Tu key test on 
log-transformed data for CXCLIO: ***P < 0.001, **P < 0.01, *P < 0.05, ns = not significant]. Symbols reflect individual donors. 



role for this pathway in host-HIV-1 interactions 

m- 

To study cGAS function, we sought to manip- 
ulate its expression in human monocyte-derived 
DCs. We generated a lentivector expressing cGAS 
and infected monocytes before differentiating 
them into DCs (see supplementary materials and 
methods). At day 4, DCs exposed to the cGAS 
virus expressed CD86 and were activated, despite 
low transduction efficiency (Fig. lA). In con- 
trast, infection with a control lentivector ef- 
ficiently transduced DCs but did not change 
basal activation of the cells (Fig. lA). This ob- 
servation confirmed that lentivectors them- 
selves do not activate DCs (10) and suggested 
that an activating innate immune signal was as- 
sociated with infection with a cGAS-expressing 
lentivector. 

Efficient lentivector transduction of DCs requires 
the protein Vpx that alleviates the restriction to HIV 
infection imposed by SAMHDl (11-13). Omitting 
Vpx prevented efficient transduction, but DC acti- 
vation by the cGAS lentivirus was maintained 



(Fig. 1, A and B), suggesting that cGAS expres- 
sion in the target cells was not required. Type 
I IFN-inducible cytokine CXCLIO was also pro- 
duced by DCs (Fig. 1C). The resulting DCs were 
fully differentiated as positive for DC-SIGN and 
negative for CD14 (fig. SIA). The cGAS lentivector 
also activated DCs that were fully differentiated 
before infection (fig. SI, B and C). To exclude a 
low level of cGAS vector transduction, we pro- 
duced HIV-1 virus-like particles (VLPs) lacking 
the lentiviral genome, and we coexpressed cGAS 
from a nonlentiviral plasmid (fig. SID). The VLP- 
containing supernatant from cGAS-expressing 
cells activated DCs (Fig. 1, D and E), indicating 
transmission of an innate signal. 

To determine the nature of this signal, we 
fractionated viral supernatants over a 10-kD fil- 
ter. Activity was depleted from the filtrate and 
maintained in the retentate (fig. S2, A and B). 
We performed differential ultracentrifugation (14) 
to separate extracellular vesicles (EVs) released by 
cells (15) from soluble factors. Cell debris and 
large apoptotic bodies pellet first (2000 g), fol- 



lowed by medium-sized vesicles (10,000 g), and 
finally small EVs, including exosomes (100,000 g) 
(16, 17). The activity corresponded to the fraction 
that contained Gag, regardless of the presence of 
the exosome-associated proteins CD63, CD9, and 
CD81 and cytosolic syntenin-1 (Fig. 2A and fig. 
S2, C and D). This finding suggested that viral 
particles can transmit the innate signal. 

Next, we examined which components were re- 
quired. Activity was abrogated when Glu^^^-^Ala^^ 
(E225A) and Asp^^^->Ala^^^ (D227A) mutations 
were introduced to the cGAS catalytic active site 
(18) (Fig. 2, B and C). VLPs are produced by ex- 
pressing the viral proteins Gag and Pol and the 
fusogenic viral envelope protein VSV-G (vesicular 
stomatitis virus glycoprotein). Omitting expres- 
sion of Gag and Pol, VSV-G, or all three proteins 
decreased induction of CD86 and CXCLIO in DCs 
(Fig. 2, B and C, and fig. S3A). The absence of 
VSV-G led to a strong decrease in DC activation 
(Fig. 2, B and C, and fig. S3A), indicating that 
fusogenic viral protein-containing extracellular 
material is the major DC-activating factor. We 



SCIENCE sciencemag.org 



11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 1 233 



RESEARCH \ REPORTS 



A — cGAS VLPs — cGAS ADBD VLPs — EV-depleted media C 



D 



Fractions 





Concentration ' xxxx' xxxx' xxxx' xxxx' xxxx 

CO-I-CO-I- N-CON-CO N-CON-CO N-CON-CO CO-i-CO-i- 

factor of input ggcvid 




VSV-G 



+ + + 



CO 

00 

Q 

O 



loo- 

se- 

60 - 

40 - 

20 - 

0 - 







E 3000 - 



O) 

Q. 



2000 - 



^ 1000 - 
X 

o 0- 



HIV-1 CCR5-tropic + + + + 

AZT + + + + 

cGAS — — + + 

Viral entry inhibitors - + - + 



B 



▲ 



CD 

00 

Q 

o 



cGAS cGAS 



No cGAS No Gag/Pol E225A/D227A 




BFP 






Fig. 2. HIV-1 particles transfer an innate signal initiated by cGAS. (A) CD86 
expression and CXCLIO production in DCs after dose-response infections of 
monocytes with differentially fractionated supernatants containing VLPs produc- 
ed from 293FT expressing WT cGAS or an inactive cGAS mutant lacking the DNA 
binding domain (ADBD). The volume of each fraction used for infection and the 
corresponding concentration factor compared to the initial supernatant are indi- 
cated (n = 3 independent donors combined from two experiments: mean and 
SEM plotted). (B) BFP and CD86 expression in DCs after exposure of mono- 
cytes to cell-free supernatants of cells transfected with combinations of plasmids 
expressing Gag, Pol, and VSV-G together with plasmids encoding cGAS, cGAS 
E225A and D227A, or control. (C) Analysis of CD86 expression and CXCLIO 



production as in (B) (n = 6 independent donors combined from three experi- 
ments: one-way ANOVA with post-hoc Tukey test for CD86 expression, one-way 
ANOVA with post-hoc Tukey test on log transformed data for CXCLIO produc- 
tion: < 0.001, < 0.01, *P < 0.05, ns = not significant). (D) CD86 ex- 

pression and CXCLIO production in DCs after infection of monocytes with 
CCR5-tropic HIV-1 viral particles produced in presence or absence of cGAS 
and HIV-1 entry inhibitors. Azidothymidine (AZT) was added to inhibit HIV- 
1 cDNA synthesis in the target cells (n = 6 independent donors combined from 
three experiments: one-way ANOVA with post-hoc Tukey test for CD86 expres- 
sion, one-way ANOVA with post-hoc Tukey test on log transformed data for 
CXCLIO production: ***P < 0.001, **P < 0.01). Symbols reflect individual donors. 



next examined the effect of tetherin expression, 
an inhibitor of viral particle release (19). Tetherin 
inhibited viral particle release and significantly 
decreased transfer of the innate signal (fig. S3, B 
to D). To rule out the possibility that transmission 
of the innate signal was unique to VSV-G, which 
can lead to production of tubulovesicular struc- 
tures (20), we produced VLPs carrying the in- 
fluenza virus envelope proteins HlNl and H5N1 
instead of VSV-G. When produced in the pres- 
ence of cGAS, the VLPs activated DCs (fig. S4, A 
and E). VSV-G-pseudotyped gammaretroviral mu- 
rine leukemia virus particles and HIV-1 particles 
expressing the wild-type (WT) CCR5-tropic enve- 
lope protein BaL also transmitted the innate signal 
(fig. S4, B to E). Incubation of target cells with HIV- 
1 entry inhibitors disrupted transfer of the innate 
signal without impairing the response to trans- 
fected cGAMP (Fig. 2D and fig. S4F). These data 
demonstrate that fiisogenic viral particles trans- 



fer the innate signal from cGAS-expressing cells 
to DCs. 

We could not detect cGAS protein in the pel- 
leted supernatants (fig. SID and fig. S2C). We hy- 
pothesized that cGAMP, a small molecule (mass = 
675 daltons) produced in the cytosol, could be 
packaged in the viral particles and EVs because 
these structures contain cytosol from the produc- 
ing cells (16). 

If cGAMP was transferred by viral particles, 
the latter should activate a type I IFN response 
in a STING-dependent but cGAS-independent 
manner. We transfected an IFN reporter con- 
struct with or without a STING plasmid in 293FT 
cells that lack detectable cGAS expression (fig. 
SID), and we validated the assay by transfected 
synthetic cGAMP or a cGAS expression plasmid 
(Fig. 3A). VLPs produced from cGAS-expressing 
cells activated the reporter only in the presence 
of STING (Fig. 3A and fig. S5A). Supernatants 



from cGAS-expressing cells that did not produce 
VLPs were much less effective (Fig. 3A and fig. 
S5A). To further demonstrate that cGAMP was 
present in the viral particles, we used a bioassay 
based on permeabilized THP-1 cells (a monocytic 
cell line) and an IFN reporter cell line (21, 22) 
(Fig. 3B). We extracted small molecules from 
virus-producing cells and pelleted VLPs. cGAMP 
activity was detected in cells transfected with 
cGAS but not in control cells. cGAMP activity 
was also detected in the pelleted VLPs and was 
lost when we introduced the cGAS E225A and 
D227A mutations (Fig. 3B). To confirm the pres- 
ence of cGAMP in viral particles, we quantified 
the amount of cGAMP in the fractionated super- 
natants of cells expressing cGAS with or with- 
out viral particles. Expression of viral particles 
increased the quantity of pelleted cGAMP, whereas 
levels of the exosomal protein syntenin-1 were not 
affected (fig. S5, B to E). We also confirmed the 



1234 11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 



sciencemag.org SCIENCE 



RESEARCH \ REPORTS 



CO. 

Z 

LL 



B 




H no treatment 
H cGAS transfection 

H synthetic cGAMP 
+ iipofectamine 

H controi virai particies 
CD cGAS no Gag/Poi-no VSV-G 
H cGAS virai particies 




^^3000 

% z 
o 

^ o 2000- 



- io 1000- 

CD 9? 



CG4MP 

bioassay 



i 



■ producer ceiis 

■ virai particies 



control 

synthetic cGAMP 



cGAS cGAS 

E225A/D227A 











MS2 




31 j 


50£ 

^ 476.05.. 


06534.07 











cGAS viral particles 



m/z (Da) 






Fig. 3. Viral particles package and transfer cGAMP. 

(A) 293FT cells transfected with a luciferase reporter 
plasmid under control of the IFN-p promoter with or 
without a STING coding plasmid. The cells were stim- 
ulated with titrated amounts of supernatants from 
cells producing viral particles in the presence (cGAS 
viral particles) or absence (control viral particles) of 
murine cGAS and supernatants from cells express- 
ing murine cGAS (cGAS no Gag/Pol no VSV-G), stim- 
ulated with synthetic cGAMP using Iipofectamine, or 
transfected with a plasmid coding for cGAS (cGAS 
transfection) (n = 3 independent experiments: one-way 
ANOVA with post-hoc Tukey test on log-transformed 
data: ****P < 0.0001. Graph shows average ± SEM). 
The dilution factor of the wedges is threefold per 
bar. (B) cGAMP quantification in extracts coming from 
293FT transfected cells and pelleted viral particles. 
293FT cells were transfected with a lentiviral packaging 
plasmid in the presence of cGAS alone or catalytically 
inactive cGAS with E225A and D227A mutations. 
Type I IFN activity was measured after permeabilized 
PMA-differentiated THP-1 cells were exposed to syn- 
thetic 2'3'-cGAMP (left) or the benzonase-resistant 
extracts (right) {n = 3 independent experiments, ex- 
cept n = 2 for empty control. Graph shows average ± 
SEM). The dilution factor of the wedges is threefold 
per bar. (C) Liquid chromatography-tandem mass 
spectrometry analysis was used to identify cGAMP 
in extracts of pelleted cell-free EVs from cells produc- 
ing VLPs in the presence of cGAS. (Top) Extracted 
ion chromatogram (XIC) of synthetic cGAMP [mass/ 
charge ratio (m/z) = 675.05 to 675.15], full mass spec- 
trum (MSI) at 12.6 min, and tandem mass spectrum 
(MS2) cumulated (12.6 to 12.9 min) after fragmenta- 
tion of the precursor ion (m/z = 675.1) shown in the 
MSI scan. (Bottom) XIC of cGAS viral particle extract 
(m/z = 675.05 to 675.15), full mass spectrum (MSI) at 
12.6 min, and tandem mass spectrum (MS2) cumu- 
lated (12.6 to 12.8 min) after fragmentation of the pre- 
cursor ion (m/z = 675.1) shown in the MSI scan (with 
subtracted MS2 scan at 12.57 and 12.82 min). Arrows 
indicate the m/z values specific to cGAMP. 
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Fig. 4. cGAMP is packaged in enveloped viruses produced by infection, and its transfer 
by viral particles induces an antiviral response in target cells. (A) Detection of cGAMP 
in extracts of MVA-green fluorescent protein (GFP) [4 x 10® plaque-forming units (pfu)] and mCMV (2.7 x 10^ pfu). XICs were measured by ultrahigh-performance 
liquid chromatography coupled with triple quadrupole mass spectrometry in multiple-reaction monitoring mode of the target ion with m/z = 476.1. Gray traces 
show the blank before the sample. Red dashed lines indicate the retention time of cGAMP. A representative analysis is shown, and additional replicates are 
shown in fig. S8. (B) Outline of the experiment in (C). (C) Monocytes were infected with the indicated supernatants or treated with type I IFN. The resulting DCs 
were infected with a dose-response of HSV-1 encoding GFP. GFP expression was analyzed after 24 hours (n = 4 independent donors combined from two 
experiments: one-way ANOVA with post-hoc Tukey test: ***P < 0.001, ns = not significant). Symbols reflect individual donors. MOI, multiplicity of infection. 



SCIENCE sciencemag.org 



11 SEPTEMBER 2015 • VOL 349 ISSUE 6253 1235 



RESEARCH \ REPORTS 



presence of cGAMP by performing mass spec- 
trometry analysis on the extract of pelleted viral 
particles, using synthetic cGAMP for reference 
(Fig. 3C). 

Next, we sought to determine whether transfer 
of cGAMP by viral particles would occur at a 
natural level of cGAS expression. HeLa cells ex- 
press the cGAS protein and produced cGAMP 
after DNA stimulation in a cGAS-dependent man- 
ner (fig. S6, A and B). We pelleted the super- 
natants of unstimulated HeLa, DNA-stimulated 
HeLa, or HeLa transfected with VLPs coding plas- 
mids that also provide a DNA stimulus. cGAMP 
was detected in the supernatants of all DNA- 
stimulated HeLa cells, consistent with its pack- 
aging in both EVs and viral particles (fig. S6C). 
However, although HeLa-derived VLPs induced 
CXCLIO production in phorbol 12-myristate 13- 
acetate (PMA)-treated THP-1 cells, EVs from con- 
trol HeLa or DNA-stimulated HeLa cells did not 
(fig. S6D). To ascertain whether cGAMP was trans- 
ferred, we tested the supernatants in the lucif- 
erase reporter assay. HeLa-derived VLPs, but not 
EVs from DNA-stimulated HeLa, activated the 
reporter in a STING-dependent manner (fig. S6, 
E and F). Overall, these data demonstrate that 
viral particles and EVs package cGAMP produced 
by endogenous cGAS, but only viral particles can 
efficiently transfer the second messenger cGAMP. 

Transfected DNA provides a strong stimulation 
for cGAS. We sought to test whether HIV-1 infection 
itself, which generates cDNA after reverse tran- 
scription, would lead to sufficient cGAMP pro- 
duction for packaging in virus progeny. Upon 
infection of HeLa cells, cGAMP accumulation 
was observed in the cells and inhibited by reverse 
transcriptase inhibitors, indicating that cGAMP 
was produced in response to the infection (fig. S7, A 
and B). cGAMP was also detected in the super- 
natants of both transfected and infected cells 
(fig. S7, C to E). Thus, when HIV-1 infection ac- 
tivates cGAS, cGAMP is incorporated in the ex- 
tracellular material containing the viral progeny. 
Additionally, we examined the presence of cGAMP 
in two DNA viruses: murine cytomegalovirus 
(mCMV), a member of the herpesviridae family 
of which herpes simplex virus (HSV) was shown 
to stimulate cGAS (4), and Modified Vaccinia An- 
kara virus (MVA), an attenuated poxvirus that re- 
quires cGAS for sensing by murine DCs (23). MVA 
and mCMV were produced by infection of cGAS- 
expressing cells (table SI). We found that cGAMP 
was present in the extracts of sucrose-purified 
stocks of both viruses (Fig. 4A and fig. S8, A and B). 

Finally, we tested whether eliciting cGAMP 
transfer to DCs by HIV-1 particles could induce a 
functional antiviral response. We exposed mono- 
cytes to cGAMP-containing VLPs, control VLPs, 
or supernatants from cGAS-expressing cells and 
challenged them with a replication-competent 
HSV-1 (24) and a replication-defective vaccinia 
strain (Fig. 4B and fig. S8C). IFN treatment was 
used as a positive control (Fig. 4C and fig. S8D). 
cGAMP-containing VLPs, but not control VLPs 
or supernatants from cGAS-expressing cells, con- 
ferred protection against viral infections (Fig. 4C 
and fig. S8D). Collectively, our results provide ev- 
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idence that cGAMP can be transferred between 
cells by virtue of packaging within viral particles, 
defining a mechanism of innate immune signal 
transmission (fig. S9). 

The spread of innate responses is generally at- 
tributed to the production of cytokines, including 
IFNs. Effectors such as APOBEC3G can be pack- 
aged into viral particles and EVs (25-27), but 
they do not signal in the target cells. cGAMP can 
diffuse between physically connected cells via 
gap junctions (28). Viral transfer of cGAMP does 
not require a direct contact between the cells, 
which may allow distant transmission of an in- 
nate signaling molecule within the organism or 
between hosts. This process could maximize the 
rapid induction of effector responses in target 
cells and bystander uninfected cells through lo- 
cal IFN production. Similarly, immunostimula- 
tory cyclic dinucleotides naturally produced by 
bacteria can be delivered into the target cell (3). 

Our results establish that HIV-1 may generate a 
cGAMP-containing progeny in cells that express 
cGAS. However, HIV-1 escapes cytosolic innate 
sensing by cGAS and thus is unlikely to contain 
cGAMP upon natural infection (6, 10, 29, 30). 
Eliciting cGAMP transfer by HIV-1 leads to ac- 
tivation of antiviral defenses. This parallels anti- 
viral restriction factors that are inactive against the 
WT virus (31). In contrast, we find that purified 
stocks of mCMV and MVA contain cGAMP. Thus, 
cGAMP packaging is likely relevant to a large num- 
ber of viruses and vaccines that naturally stimulate 
cGAS. The contribution of virally packaged cGAMP 
as compared to other innate signals will probably 
vaiy according to the fiinctionality of cGAS in 
target cells. 

Our results indicate that cell-derived EVs that 
include exosomes can package cGAMP as well. 
EVs also package and transmit cellular RNA be- 
tween cells (32, 33). However, transmission of 
cGAMP by cellular EVs is not efficient. Viral en- 
velope proteins increase transmission, indicating 
that fusion of EVs from noninfected cells with 
target cells is limiting. Nonetheless, low-efficiency 
transmission of cGAMP by host EVs may con- 
tribute to the tonic IFN response (8, 34). 

Packaging of cGAMP within viral particles can 
be interpreted as an immune tagging process, al- 
lowing infected cells to further signify progeny 
viruses as nonself, or danger, to alert subsequent 
target cells. We speculate that other signaling mol- 
ecules are also packaged and disseminated by 
viral particles. Subverting viral particles with 
cGAMP constitutes an attractive approach for 
therapeutics and vaccines. 
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CHROMOSOMES 

The inner centromere-shugoshin 
network prevents 
chromosomal instability 

Yuji Tanno,^ Hiroaki Susumu,^’^ Miyuki Kawamura,^’^ Haruhiko Sugimura,^ 

Takashi Honda/ Yoshinori Watanabe^’^* 

Chromosomal instability (CIN) is a major trait of cancer cells and a potent driver of 
tumor progression. However, the molecular mechanisms underlying CIN still remain 
elusive. We found that a number of CIN^ cell lines have impairments in the integrity of 
the conserved inner centromere-shugoshin (ICS) network, which coordinates sister 
chromatid cohesion and kinetochore-microtubule attachment. These defects are caused 
mostly by the loss of histone H3 lysine 9 trimethylation at centromeres and sometimes 
by a reduction in chromatin-associated cohesin; both pathways separately sustain 
centromeric shugoshin stability. Artificial restoration of the ICS network suppresses 
chromosome segregation errors in a wide range of CIN^ cells, including RB- and 
BRCAl-deficient cells. Thus, dysfunction of the ICS network might be a key mechanism 
underlying CIN in human tumorigenesis. 



M ost cancer cells exhibit elevated chro- 
mosome mis-segregation, called whole- 
chromosomal instability, as well as 
structural changes in their chromosomes 
(e.g., translocations, deletions). These 
chromosome aberrations are believed to allow 
cells to rapidly acquire genetic changes that pro- 
mote tumorigenesis (1). Several cellular defects, 
such as replication stress (2), multipolarity of 
the spindle (3), weak cohesion (4) and increased 
stability of kinetochore fibers (5), are proposed 
to be involved in CIN in cancer cells. Although 
merotely (the attachment of a single kinetochore 
to microtubules oriented toward opposite poles) 
might be a major cause of CIN (6, 7), the un- 
derlying molecular mechanisms remain elusive. 

We assessed 17 cell lines, including tumor- and 
non-tumor-derived cell lines, for anaphase chro- 
mosome segregation errors, which were classified 
into two categories: (i) lagging chromosomes 
containing kinetochore signals, which reflect at- 
tachment defects, and (ii) acentric lagging chro- 
mosomes and chromosome bridges, which reflect 
structural defects (Fig. lA). The ratio of attach- 
ment and structure defects in CIN cells deviates 
somewhat from earlier results, in part because 
CIN cells are not isogenic (7-9). Intriguingly, al- 
though widely used HeLa cell lines (HeLa-2 and 
HeLa-3) showed segregation errors, a variant 
HeLa cell line (HeLa-1) showed few errors in 
anaphase (Fig. lA). On the basis of the frequency 
of segregation errors, we define cell lines that are 
error-prone as CIN^ and those that are error-free 
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as CIN". Consistent with a previous observation 
(5), the majority of our CIN^ cell lines showed 
increased kinetochore-microtubule fibers as com- 
pared with CIN" cell lines (correlation coefficient 
r = 0.40) (Fig. IB), whereas these lines have an 
intact spindle checkpoint (10) and show some cor- 
relation (r = 0.28) with spindle multipolarity (3) 
(fig. SI, A and B). Thus, our results favor the notion 
that microtubule attachment to kinetochores is 
abnormally stabilized in CIN^ cell lines (5, 6). 

Because mutations affecting sister chromatid 
cohesion might also underlie CIN (4, 11), we ex- 
amined centromeric cohesion in the same col- 
lection of cell lines. Sister chromatid cohesion 
at metaphase is largely normal in CIN^ cells (7). 
However, when cells were arrested at metaphase 
with the proteasome inhibitor MG132, most CIN^ 
cells exhibited premature sister chromatid sepa- 
ration to various extents, whereas CIN" cells large- 
ly retained cohesion (Fig. 1C and fig. SIC); these 
findings reveal that “cohesion fatigue” (12) is a 
cohesion defect closely related to CIN (r = 0.54). 
Together, our analyses indicate that stabilized 
kinetochore-microtubule attachment and decreased 
centromeric cohesion are shared traits of CIN"^ 
cells, which suggests that these defects are caused 
by a common (but as yet unknown) abnormality 
in cancer cells. 

Sister chromatid cohesion and kinetochore- 
microtubule attachment are regulated by shugoshin 
(SCO) and the Aurora B-containing chromosomal 
passenger complex (CPC), core components of 
the inner centromere-shugoshin (ICS) network 
(i3-i5). Therefore, we hypothesized that the ICS 
network might be impaired in CIN^ cells. Al- 
though humans have two shugoshin-like proteins 
(SGOl and SG02), SGOl predominantly functions 
in both cohesin protection and CPC localization 
during mitosis (16, 17) (fig. S2A). In Hdlld (CIN") 
cells, when centromeres come under tension in 
metaphase, kinetochore-associated BUBl (a SGOl 



recruiter) is pulled outward while a substantial 
amount of SGOl remains at the inner centro- 
mere, together with the CPC (18) (Fig. ID and 
fig. S2, B to D). In HeLa-2 (CIN^) cells, however, 
SGOl largely disperses on aligned chromosomes 
with the effect of reducing the CPC from the in- 
ner centromere (Fig. ID). Although a previous 
study suggested that CPC localization is reduced 
in two cancer cell lines, particularly on unaligned 
chromosomes (19), our analyses reveal that the 
reduction in the SGOl-CPC pool is pronounced 
on aligned chromosomes in HeLa-2 (CIN^) cells 
(Fig. ID). We do not know the reason for this 
discrepancy; however, SGOl-CPC reduction on 
aligned chromosomes is a prevalent feature of our 
CIN"^ cell lines (12 of 14 cell lines) (Fig. IE and 
fig. S3). Further, our analyses (fig. S4) suggest a 
potential causal link between SGOl-CPC reduc- 
tion and CIN. Because the activity of Aurora B 
kinase at the inner centromere is required to de- 
stabilize kinetochore microtubules (13, 14) (fig. 
S5), we reason that the dispersion of the CPC 
under tension leads to the stabilization of erro- 
neous microtubule attachment, a cause of mer- 
otely (5-7). 

To further explore the root cause of SGOl-CPC 
reduction, we focused on nine cell lines that show 
dominant attachment defects (fig. S6). Given that 
SGOl interacts with the heterochromatin protein 
HPla (20), we envisaged that HPla might play an 
important role in anchoring SGOl-CPC to the in- 
ner centromere. Indeed, HPla localizes to the 
inner centromere in CIN" cells, whereas it is dis- 
persed during alignment in a number of CIN^ 
cells (Fig. 2A). The reduction of HPla by RNA 
interference (RNAi) in HCT116 (CIN") cells indeed 
caused SGOl-CPC dispersion on aligned chromo- 
somes and the CIN phenotype (Fig. 2B and fig. S7). 
Notably, all these CIN^ cell lines were defective 
in trimethylated Lys^ of histone H3 (H3K9me3) 
(Fig. 2C and fig. S8), which usually occupies 
centromeric chromatin and acts as a basis of 
HPla binding in CIN" cells (21). Indeed, the 
chromodomain mutant (Val^^ ^ Met) of HPla, 
whose binding affinity to methylated H3K9 is di- 
minished (22), showed reduced localization at cen- 
tromeres (fig. S9). We therefore reasoned that 
centromeric HPla binds H3K9me3 and aids stabi- 
lization of the ICS network by tethering SGOl to 
H3K9me3 (see below). 

Because members of the KDM4 subfamily (de- 
methylases of H3K9me3) are often overexpressed 
in cancer cells (23), we examined their expression 
in the CIN^ cell lines, and found an elevated ex- 
pression of KDM4s in eight of nine CIN^ cell lines 
(fig. SIO). HeLa-2 (CIN^) cells overexpressed 
KDM4C in comparison with HeLa-1 (CIN") cells 
(fig. SIO). Accordingly, the ectopic expression of 
KDM4C in HeLa-1 (CIN") cells leads to a loss of 
H3K9me3 on the mitotic centromeres, resulting 
in a reduction of HPla and SGOl-CPC on aligned 
chromosomes, which leads to CIN (fig. Sll). These 
results argue for a causal link between centro- 
meric H3K9me3 and CIN that is mediated by 
the ICS network. 

Notably, three of nine CIN^ cell lines with 
defects in microtubule attachment showed a 
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reduction in cohesin localization during G 2 
phase (fig. S12A). Considering that cohesin in- 
teracts directly with SGOl and that this in- 
teraction is required for the retention of SGOl at 
the inner centromeres (24), we confirmed that 
partial depletion of RAD21 (cohesin subunit) by 
RNAi in HCT116 cells and the H4 neuroglioma 
cell line harboring an inactivating mutation of 
STAG2 (another cohesin subunit) produced a 
marked impairment of SGOl-CPC localization in 
aligned chromosomes and chromosome segre- 
gation errors at anaphase (fig. S13, A to E). 
Cohesin-defective cells, however, retained intact 
centromeric H3K9me3 (fig. S13F), which suggests 



that cohesin and H3K9me3-HPla are separate 
pathways acting to sustain the ICS network. 

The foregoing results suggest that the stability 
of SGOl at centromeres maybe impaired in CIN^ 
cells. We performed FRAP (fluorescence recovery 
after photobleaching) experiments in four cell 
lines expressing SGOl fused to green fluorescent 
protein (GFP-SGOl). GFP-SGOl proteins exhibited 
a shorter half-recovery time in CIN"^ cells relative 
to CIN" cells (Fig. 3, A and B), implying that SGOl 
localization is destabilized in CIN^ cells. Because 
SGOl interacts directly with cohesin and HPla 
through distinct domains (20, 24), we generated 
point mutations in the SGOl protein— Thr^^^ ^ 



Ala (T346A), Val^ ^ Glu (V453E), or both (TAVE)- 
that disrupt the specific interaction with cohesin 
or HPla, respectively (20, 24). Although all mutant 
proteins fused with GFP were expressed equally 
in HeLa-1 (CIN“) cells, the centromeric localization 
was moderately impaired for the SG01-T346A 
and SG01-V453E proteins and was mostly abol- 
ished in the SGOl-TAVE protein (Fig. 3C). Even 
in CIN“ cells, mutant GFP-SGOl proteins exhi- 
bited a shorter half-recovery time (Fig. 3D), im- 
plying that the interactions of SGOl with cohesin 
or HPla act independently but cooperatively to 
stabilize SGOl at the inner centromeres (Fig. 3E). 
Therefore, although the cohesin and H3K9me3- 





Fig. l.The ICS network underlies CIN. (A) Left: A classification of anaphase 
chromosome segregation errors. Right: Percentage of anaphases showing 
chromosome segregation errors. (B) Cold-stable kinetochore microtubules in 
metaphase were quantified. (C) Cells showing >50% sister chromatid sep- 
aration were counted after treatment with MG132 nocodazole. A total of 
100 cells or microtubules were counted in each experiment: error bars, SD from 
three independent experiments. (D) The indicated cell lines were immuno- 
stained in prometaphase or metaphase. Kinetochore pairs on aligned or un- 



aligned chromosomes are magnified. A schematic diagram shows SGOl-CPC 
(Aurora B) localization in CIN“ and CIN^ cells. (E) Quantification of SGOl 
(top) and Aurora B (bottom) enrichment in the inner centromeres. The in- 
tensity of the signal at the inner centromere was divided by the intensity at 
the chromatin region: n = 50 centromeres. Error bars, SEM from three in- 
dependent experiments. *P < 0.05, **P < 0.01, ***P < 0.001, one-way anal- 
ysis of variance (ANOVA) with Bonferroni’s multiple comparisons test relative 
to HCT116. Scale bars, 5 |im. 
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Fig. 2. HPl-H3K9me3 sustains the 
ICS network. (A) HeLa-1 (CIN“) and 
HeLa-2 (CIN^) cells were immunos- 
tained with the indicated antibodies. 
For quantification of HPla enrichment 
in centromeres at metaphase, the 
intensity of the signal at the inner 
centromere was divided by the 
intensity on the chromatin region: 
n = 50 centromeres. Error bars, 

SEM from three independent 
experiments. (B) Percentage 
of anaphases showing attachment 
defects. Error bars, SD from 

three independent experiments. (C) Quantification of H3K9me3 enrichment in centromeres in nocodazole- 
arrested cells. The signal intensity at the centromere was divided by the intensity at the chromosome 
arm region: n = 50 centromeres. Error bars, SEM from three independent experiments. *P < 0.05, **P < 
0.01, ***P < 0.001, one-way ANOVA with Bonferroni’s multiple comparisons test relative to HCT116 [(A) 
and (C)] or t test (B). Scale bars, 5 |im. 




HPla pathways may regulate global chromatin 
functions such as transcriptional regulation (25, 26), 
our results argue that these pathways act to 
suppress CIN by stabilizing SGOl at the centro- 
meres. Further, we examined the ICS network in 
cells defective in two well-known tumor sup- 
pressor genes, RB or BRCAl, and found that they 
are required for the integrity of cohesin and/or 
H3K9me3 (figs. S12B and S14; also see below). 

Our immunostaining assays (Fig. 2C and 
figs. S12 and S14F) indicate that CIN^ cells with 
predominant defects in kinetochore-microtubule 
attachment can be classified into three catego- 
ries: (i) H3K9me3-defective, (ii) cohesin-defective, 
and (iii) doubly defective cells (Fig. 4). To ex- 
amine whether the reduction of centromeric 
H3K9me3 and/or chromatin-associated cohesin 
in CIN^ cells is the cause of CIN, we sought to 
restore these defects by tethering H3K9 trime- 
thyitransferase Suv39Hl to centromeres (CENP-B 
fiision; Suv39-CB) and/or by expressing a cohesin 
stabilizer, sororin (27) (Fig. 4A and fig. S15). 
Strikingly, the expression of Suv39-CB suppressed 
CIN in four of five H3K9me3-defective CIN^ 
cells but not in cohesin-defective RB-reduced 
HCT116 (CIN^) cells (Fig. 4, A and B). In con- 
trast, the expression of sororin suppressed CIN 
in RB-reduced HCT116 (CIN^) cells but not in 
H3K9me3-defective CIN^ cells, validating the 
specificity of the assay. Finally, two of three CEST 
cell lines defective in both the H3K9me3 and co- 
hesin pathways were suppressed for CIN by the 
expression of both Suv39-CB and sororin, but not 
either alone (Fig. 4, A and B). We confirmed that 
CIN suppression is in each case accompanied 
by the restoration of the ICS network (fig. S15). 




A Before 
bleach 






Fig. 3. HPl and cohesin cooperate 
to sustain the ICS network. (A) 

Nocodazole-treated cells expressing 
GFP-SGOl were examined by FRAP 
analysis. (B) Left: Recovery of the 
fluorescence signal intensity after 
photobleaching (t = 0) relative to 
the prebleached intensity (100%): 
n = 5 cells (HGT116, HeLa-2, 293T) 
or 10 cells (HeLa-1). Right: Half- 
recovery time. Error bars, SD from 
three independent experiments. 

(C) Nocodazole-treated HeLa-1 
(GIN“) cells transiently expressing 
GFP-SGOl and its mutants were 
examined by immunoblotting and 
immunostaining. The relative local- 
ization of SGOl was examined: 
n = 50 centromeres. (D) Turnover of 
GFP-SGOl in cells prepared as in 
(0) was examined by FRAP analysis: 
n = 10 cells. (E) A schematic illus- 
tration of the lOS network. Error 
bars, SEM from three independent 
experiments. **P < 0.01, 

< 0.001, ****P < 0.001, 
one-way ANOVA with Bonferroni’s 
multiple comparisons test. 

Scale bars, 5 |im. 
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relative to non-expressing cells. (B) Suppression of ICS 
with defects (-) in H3K9me3 and chromatin-associated 



Fig. 4. Promotion of H3K9me3 and 
cohesin association with chromatin 
suppress CIN. (A) The percentage of 
anaphases showing attachment defects 
was measured in the indicated cells 
expressing HA-Suv39Hl-CENP-B and/or 
GFP-sororin. Error bars, SEM from three 
independent experiments. **P < 0.01, 

< 0.001, one-way ANOVA with 
Bonferroni’s multiple comparisons test 
network defects and CIN in various CIN^ cell lines 
cohesin. The solid box indicates the suppression. 



In smnmaiy, both ICS network defects and mi- 
totic segregation errors are rescued by restor- 
ing H3K9me3 and/or cohesin in seven of nine 
CIN^ cells, which suggests that these two path- 
ways are important CIN-susceptible targets in 
human cells. 

Our study reveals that impairment in the ICS 
network prevails among human CIN^ cell lines 
and acts as a key molecular mechanism to gen- 
erate CIN. Although numerous cellular mutations 
may influence chromosome segregation, only mu- 
tations that moderately impair chromosome seg- 
regation will establish CIN^ cells, which divide but 
exhibit genomic instability. In fact, the depletion 
of a core component of the ICS network (BUBl, 
SGOl, or the CPC) from cells provokes serious 
mitotic defects that lead to lethal mitosis, whereas 
hypomorphic mutations in the ICS network com- 
ponents cause CIN in cultured human cells and 
promote tumorigenesis in mice (28, 29). None- 



theless, mutations in components of the ICS 
network (BUBl, SGOl, CPC) are rare in human 
cancer genomes (30), implying that the path- 
ways regulating the ICS network, rather than its 
core components, might be genetically or epi- 
genetically altered during tumorigenesis. Indeed, 
we detected an ICS network defect associated 
with chromosome segregation errors in a patient 
with non-small cell lung cancer (fig. S16). Future 
studies, including the inspection of a substantial 
number of clinical samples, will validate the po- 
tential linkage between ICS network defects and 
CIN in tumors. 
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Top-Down Proteomics: 
Turning Protein Mass 
Spec Upside-Down 

Between alternative transcription start sites, alternative 
splicing, and posttranslational modifications, a given gene 
may produce dozens of protein variants, each with a different 
biological activity. Teasing apart those structure-function 
relationships requires mapping specific variants to their 
associated biological functions, and the tool of the trade for 
doing so is mass spectrometry. But not just any mass spec 
will do. Researchers need a holistic view of protein structure, 
data that is lost with the popular “bottom-up” proteomics 
strategy. Powered by today’s ultrahigh-resolution, high mass- 
accuracy mass specs, protein biochemists are increasingly 
turning bottom-up upside-down. Their new alternative: top- 
down proteomics. By Jeffrey M. Perkel 

I f you want to know which of a gene’s many variants, or 
“proteoforms,” is responsible for a particular biological ac- 
tivity, you need a way to detect that isoform directly. That’s 
easier said than done. 

Proteoform analysis is fundamentally a two-part problem. The 
first part, protein identification, is a simple question of peptide 
sequencing: matching spectral peaks to a protein’s amino acid 
sequence and thence to the gene that encoded it. This can 
be complicated if related proteins are present in a sample, be- 
cause they share identical stretches of amino acid sequences, 
but in general is relatively straightforward. 

Tougher by far is characterization. A given protein may exist 
in dozens of forms distinguished by just a few daltons, variants 
I that differ in terms of messenger RNA (mRNA) splicing, post- 
g translational proteolytic processing, and chemical modification, 
i Take histones, for instance. Histone proteins can be heavily 

^ modified by methyl, acetyl, and phosphoryl groups, among 

g others, at their N-termini, which in turn can impact chromatin 
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structure and gene expression. In 2009, University of Pennsyl- 
vania Presidential Associate Professor Benjamin Garcia (then 
at Princeton University) used a so-called middle-down strat- 
egy— in which relatively large protein fragments (bigger than 
tryptic peptides but smaller than intact proteins) are analyzed 
and sequenced in the mass spectrometer— and some clever 
chromatography to resolve and identify 70 proteoforms of hu- 
man histone H4 and 200 of human histone H3.2. 

It isn’t clear that every one of those variants has a different 
biological activity, of course. But the only way to know is to 
accurately tally them and track their changes under different 
biological conditions. And therein lies the rub. In bottom-up 
proteomics, researchers digest their proteoforms to peptides, 
separate them via liquid chromatography, and deliver them to 
the mass spectrometer. But as it cleaves the peptide back- 
bone, trypsin also destroys any chance researchers have of 
understanding how posttranslation modifications are linked. 
The enzyme can cleave the 15 kilodalton (kDA) histone H3.2 
29 times, including more than a dozen sites in the critical N- 
terminal tail. Using a bottom-up strategy effectively destroys 
information on how those individual chemical modifications 
are related, meaning researchers may be able to see that 
given modifications are present, but are largely blind to their 
interplay and stoichiometry. They certainly wouldn’t be able to 
determine if, say, two modifications were coincident or mutu- 
ally exclusive. 

In the top-down approach, the histone proteoforms are 
delivered to the mass spec intact and then sequenced by 
fragmentation inside the instrument, thereby retaining the 
critical linkage data. This is a more technically challenging 
strategy, in that intact proteins are harder to fractionate and 
fragment than peptides, and much harder to separate by liq- 
uid chromatography. Furthermore, it takes relatively high-end 
instrumentation to resolve such large molecules when they 
are so similar in size, and special software to do the analysis. 
Lysine trimethylation, for instance, increases protein mass by 
42.0470 Da, while acetylation adds 42.0106 Da, a difference of 
just 0.0364 Da. 

Still, top-down is on the upswing, says Neil Kelleher, the 
Glass Professor of Life Sciences at Northwestern Uni- 
versity, founding member of the Consortium for Top Down 
Proteomics and a top-down evangelist. At the recent annual 
meeting of the American Society for Mass Spectrometry 
(ASMS), for example, top-down accounted for “10% to 15%” 
of the conference, Kelleher says. “A decade ago, it was 
around 0.1%, or very fringe.” 

Top of the line 

One driver for the growth in top-down is the increasing 
availability of instrumentation capable of running the 
experiments. Given the need to distinguish proteins varying by 
only small chemical changes, top-down researchers typically 
use high-end, high-resolution instrumentation. Just a few 
years ago, that mostly meant top-of-the-line Fourier-transform 
ion-cyclotron resonance (FT-ICR) mass spectrometers, 
massive and complicated hardware offering continued> 
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One driver for the growth in top-down is 
the increasing availability of instrumentation 
capable of running the experiments. 



resolution values— and pricing— in the millions. Today, more 
affordable quadrupole-time-of-flight (qTOF) instruments, 
such as the Waters SYNAPT G2-Si, the Bruker maXis II, 
and the Thermo Fisher Scientific benchtop Orbitrap mass 
spectrometers, have made the technology more accessible. 

Still, for some jobs, only an FT-ICR will do. And one of the 
world’s most powerful such systems just went online at the 
Pacific Northwest National Laboratory (PNNL), in Washing- 
ton state. 

FT-ICR mass spectrometers derive their exquisite resolution 
from the massive cryo-cooled magnets that power them, and 
as magnetic field strength rises, so too does performance, 
says Ljiljana Pasa-Tolic, lead scientist for mass spectrometry 
at PNNL’s Environmental Molecular Sciences Laboratory user 
facility. Thus, with a more powerful magnet, “you can think 
about getting higher resolving power in the same acquisition 
time, or you can get equal resolving power in a shorter acqui- 
sition time.” 

The PNNL already has several FT-ICR instruments, includ- 
ing systems with 12 and 15 tesla (T) magnets. The new instru- 
ment, which went online in mid-March, clocks in at 
21 T. With a linear ion trap (Thermo Scientific LTQ-Velos) on 
the front end and an Agilent Technologies magnet on the 
back, the instrument “occupies almost the whole room; it 
[weighs] about 24 tons,” Pasa-Tolic says. The magnet itself 
requires about 4,000 L of liquid helium to maintain its working 
temperature of 2.19 Kelvin. (A second 21 T instrument, em- 
ploying a Bruker magnet, has been installed at the National 
High Magnetic Field Laboratory in Tallahassee, Florida.) 

The PNNL instrument went online in mid-March, Pasa-Tolic 
says, and preliminary data were presented at the recent 
ASMS conference. “We were able to demonstrate resolving 
power of about 8 million for 12-second transients, which is 
great,” she says. That 12-second analysis time is too slow 
for the traditional LC-MS workflow, in which proteins flow 
straight from a liquid chromatography (LC) column into the 
mass spectrometer (MS), she notes. But even at a more LC- 
compatible rate, the instrument yields resolutions of about 
1 million, she says, and further improvements are in the 



works. “We have demonstrated a resolving power ... an order 
of magnitude greater than what is attainable with currently 
available commercial technology.” 

Among other things, Pasa-Tolic hopes to use the 21 T to 
break the size barrier that bedevils top-down research. Top- 
down researchers typically struggle to characterize proteins 
larger than about 50 kDa, though some have used the tech- 
nique to tackle the posttranslational modifications of 150 kDa 
biotherapeutic antibodies. But with a more powerful magnet, 
it may be possible to routinely hit 100 kDa or more, Pasa-Tolic 
says. Indeed, her lab already presented data at ASMS dem- 
onstrating “isotopically resolved” analysis of 70 kDa proteins 
(such as intact bovine serum albumin) at high spectral-acqui- 
sition rates. 

Pasa-Tolic now plans to direct the instrument at secreted 
fungal enzymes, especially those that degrade lignocellu- 
lose. These heavily glycosylated proteins, weighing between 
50,000 and 100,000 Da, could advance biofuel development, 
and Pasa-Tolic is developing new reverse-phase chromatog- 
raphy strategies to separate them. “It would be very beneficial 
to figure out how this pattern of glycosylation relates to 
function and stability and eventually glycoengineer these en- 
zymes to be more stable and more commercially affordable,” 
she says. 

Laser focus 

Top-down proteomics is so named because intact 
proteins are separated and broken down into smaller and 
smaller pieces in the MS to determine their sequence and 
modifications. To do that, researchers can apply any of a 
number of protein fragmentation methods, and the more 
options available, the better. “You might want to have a lot 
of fragmentation tools available to really get the most out of 
the actual experiment,” says Andreas Huhmer, proteomics 
marketing director at Thermo Fisher Scientific. Thermo’s new 
Orbitrap Fusion Lumos, for instance, offers collision-induced 
dissociation (CID), in which the peptide backbone is broken by 
collision with a gas molecule, and the related higher-energy 
collisional dissociation (HOD). It also enables the popular 
electron-transfer dissociation, which uses a charged donor 
molecule to induce fragmentation, as well as hybrid methods, 
such as electron-transfer and higher-energy collision 
dissociation (EThcD). 

Jennifer Brodbelt, the William H. Wade Endowed Professor 
of Chemistry at the University of Texas at Austin, is develop- 
ing an alternative fragmentation approach. Ultraviolet pho- 
todissocation (UVPD) uses ultraviolet laser pulses to cause 
proteins to shatter along their backbone, producing a ladder 
of fragments that vary in size by a single amino acid. That’s 
how other fragmentation methods are supposed to work, too, 
but according to Brodbelt, most tend to fragment more ef- 
ficiently at protein termini or near charged residues, providing 
incomplete sequence coverage. UVPD seems to provide rela- 
tively uniform coverage across the entire sequence, at least 
for proteins up to 40 kDa, including the oft-overlooked protein 
center. “The fragmentation process does not seem to be as 
charge-modulated as those other methods,” she says. 

Brodbelt has worked with Thermo Fisher Scientific to 
implement the technique on Orbitrap instruments. In one 
recent paper, she applied the method on an Orbitrap Elite 
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to map the linkages in branched poly-ubiquitin chains. The 
result was a remarkable series of fragment ions, one for each 
consecutive amino acid of the ubiquitin chain, terminating 
at the residue to which the ubiquitin moiety is coupled. By 
simply counting those ions and watching where they abruptly 
disappeared, she could determine where the inter-ubiquitin 
linkages must have occurred. 

“You’d see a huge shift, a mass shift when ubiquitin 
appeared at a particular lysine,” Brodbelt explains. 

Though still in development, UVPD systems have been 
installed in several labs. The PNNL 21 T has one. So does 
John Yates III, the Ernest W. Hahn Professor at the Scripps 
Research Institute, who has mounted the system on an 
Orbitrap Fusion. Bottom-up proteomics, Yates explains, 
has long been considered easier than top-down in part 
because the infrastructure required to do it— the mass 
spectrometers, the peptide separation methods, and the 
analytical software— was already mature when the technique 
was developed. The experiments themselves were thus 
easier to perform. “For top-down, almost everything has to 
be reinvented or certainly significantly improved in order to 
make this whole workflow possible.” That, he says, explains 
his enthusiasm for UVPD. “Hopefully it will get us the kind 
of fragmentation that we need in order to effectively analyze 
these things.” 

From top-down to top-top-down 

As top-down adoption grows, so too do the technical de- 
velopments. One emerging area is what Kelleher calls “top- 
top-down,” or native mass spectrometry. The method allows 
researchers to examine multiprotein complexes in the MS, 
and one researcher making significant headway on this ap- 
proach is Vicki Wysocki, Ohio Eminent Scholar at Ohio State 
University. 

Existing fragmentation approaches, such as CID, simply 
cannot inject enough energy per collision into a protein 
complex to cause it to fall apart, Wysocki explains. “If you 
have a very large protein complex ...[and] if you are colliding 



that into argon with a mass of 40, the amount of energy that 
you can transfer will be fairly small.” Rather than falling apart, 
a protein in such a complex will simply unfold, she says. 

So, her group developed an alternative approach, surface- 
induced dissociation (SID), in which complexes are smashed 
at high speed into an inert fluorocarbon-coated gold surface. 

Using SID, Wysocki says, researchers can work out the to- 
pology of protein complexes and subcomplexes, teasing them 
apart to determine, for instance, which protein-protein inter- 
faces are strong and which are weak. Suppose a given com- 
plex is a hexamer, she explains— a dimer of trimers. “We will 
directly see those trimers as products of the SID,” she says. 

In one recent example, Wysocki’s team used that approach to 
work out the stoichiometry of the Pyrococcus furiosus RNAse 
P complex, an RNA-containing tetramer whose structure was 
previously unresolved. 

Waters has been working with Wysocki’s group to offer SID 
capability to selected investigators on the SYNAPT G2 series 
qTOFs, and Wysocki has received grant funding to implement 
the method on Orbitrap and FT-ICR instruments as well. She 
has also developed more elaborate implementations, includ- 
ing a modified qTOF containing two SID cells flanking Waters’ 
ion mobility separation unit, for performing multiple surface 
collision events. Ion mobility separation, Wysocki explains, 

“is sort of like a gas-phase electrophoresis,” separating ions 
by size and shape, and it “has been a huge help in all of this 
work.” 

Another emerging development is top-down-based mass 
spectrometric imaging, Pasa-Tolic says. Richard Caprioli at 
Vanderbilt University, and Ron Heeren in the Netherlands 
have both demonstrated laser ablation-based top-down strat- 
egies in the past year using FT-ICR mass analyzers, and Pasa- 
Tolic says she would like to apply such strategies to study the 
soil rhizosphere, for instance, to determine where different 
secreted enzymes are located. “If you think about the way we 
do top-down proteomics right now, it clearly is missing spatial 
information,” Pasa-Tolic says. “In many instances, this would 
be extremely useful to have.” 

As for Kelleher, he sees a bright future for top-down in clini- 
cal research. Indeed, it is in the clinic that one of top-down’s 
biggest successes can already be found. The Bruker BioTyper, 
a simple matrix-assisted laser desorption/ionization time- 
of-flight (MALDI-TOF) mass spectrometer for identification 
of bacterial pathogens based on intact protein masses, “has 
been a smash success .... Arguably one of the best suc- 
cesses of proteomics in clinical medicine,” Kelleher says. Now 
he hopes to apply that same top-down philosophy to clinical 
biomarker development for complex disease. 

With a growing user community, he won’t be alone in that 
work. But Kelleher remains undaunted. “I’m smiling,” he says. 
“Even if people are telling me they’ve done it better than my 
group has, I just say, ‘okay, great, it’s a big sandbox. Come 
play!”’ 



Jeffrey M. Perkel is a freelance science writer based in Pocatello, Idaho. 
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Validated Antibodies 

A new product line of PrecisonAb Anti- 
bodies has been rigorously validated for 
use in Western blotting. These antibod- 
ies offer superior sensitivity, specificity, 
and reliability. They also come with the 
industry’s most complete validation data 
so researchers can accurately assess the 
antibodies’ performance before buying, 
and a positive control lysate so research- 
ers can optimize the antibodies in their 
own labs. These are the first of thousands 
of PrecisionsAb Antibodies Bio-Rad plans 
to release to help researchers improve 
their Western blotting results. Scientists 
have struggled with antibody unreliabil- 
ity for decades because manufacturing 
standards and quality control vary widely 
among vendors, targets, and even among 
antibody lots. PrecisionAb Antibodies are 
screened using whole-cell lysates from 
up to 12 different biologically relevant cell 
lines— by far the industry’s most rigorous 
validation process. Only antibodies that 
detect endogenous protein levels with 
high sensitivity and specificity are includ- 
ed in the PrecisionAb product line. 

Bio-Rad 

For info: 800-424-6723 
www.bio-rad.com/precisionabpr 

Protein Pegylation Service 

Pegylation services are now offered to 
customers developing protein-based 
therapeutics and biosimilars through a 
new collaboration with Celares GmbH. 

The service offering includes feasibility 
studies, process and analytical devel- 
opment, and scale-up from milligram 
to gram quantities required for pilot 
and subsequent commercial scale. Pe- 
gylation, the attachment of polyethylene 
glycol (PEG) to a molecule, can signifi- 
cantly improve the pharmacological and 
physicochemical properties of peptide 
and protein therapeutics and reduce side 
effects. This well-recognized delivery 
system for biologies can enhance protein 
stability, bioavailability, and solubility, 
overcoming common challenges in the 
development of these therapeutics. The new services will leverage 
EMD Millipore’s broad range of functionalized PEG products of dif- 
ferent molecular weight and activation chemistry, as well as buffers, 
solvents and excipients, and unit operations employed during the 
pegylation process and subsequent purification, including tangen- 
tial and normal flow filtration and chromatography. 

EMD MMlipore 

For info: 800-645-5476 

www.emdmillipore.com 



Conjugated Antibodies 

A new range of conjugated antibodies 
leverages Innova’s Lightning-Link and 
InnovaCoat bioconjugation technologies 
to provide customers with antibodies 
conjugated to an extensive range of 
enzymes, fluorescent dyes, and colloidal 
gold for improved flexibility in experi- 
mental design. The new range comprises 
200 antibodies to nearly 30 different car- 
diac biomarkers, each available directly 
conjugated to 24 different enzymes, fluo- 
rescent dyes, or nanoparticles. In con- 
junction with the new range, a new cor- 
porate website provides customers with 
easier online ordering facilities, as well as 
access to a range of resources and sup- 
port materials. The website also features 
an Antibody Conjugate Generator tool, 
which enables customers ordering from 
the new conjugated antibodies range 
to select any desired cardiac marker, 
antibody clone, and label. The online 
Conjugate Generator provides custom- 
ers with unprecedented flexibility, speed, 
and conjugation possibilities, enabling 
highly tailored reagents to be created at 
a fraction of the cost of a custom conju- 
gation project. 

Innova Biosciences 
For info: +44-(0)-1 223-4961 70 
www.innovabiosciences.com/products/ 
antibody-conjugates 

Host Cell Protein Detection Kit 

The Gyrolab CHO-HCP Kit 1 detects 
and quantifies host cell proteins (HCPs) 
from Chinese hamster ovary (CHO) 
cells used in the bioprocessing of 
biotherapeutics. HCPs are measured 
throughout the production process, as 
they may affect the safety and efficacy 
of the biotherapeutic. The kit has been 
validated for use on the Gyrolab xP 
workstation and the new Gyrolab xPlore 
and is ready to use, eliminating the need 
for time-consuming assay development. 
The automation capabilities of Gyrolab 
systems enable 96 data points to be 
generated in approximately 1 hour without manual intervention, 
saving time and reducing errors and repeats. The kit broadens the 
analytical range from typically two orders of magnitude to four, and 
therefore requires fewer dilutions and fewer reruns than alternative 
methods. The kit contains all the reagents needed to produce 96 
data points, including a specific Bioaffy CD that has been optimized 
for this HCP assay. 

Gyros 

For info: +46-(0)-1 8-566-300 
www.gyros.com 




Protein Digestion Kit 

A new tool for protein digestion, the 
SMART Digest kit is designed to 
substantially reduce preparation times 
and associated errors, facilitating fast, 
reliable analytical results. Current protein 
digestion techniques comprise complex 
procedures requiring determination 
of protein concentration followed by 
sample reduction and alkylation prior to 
digestion with trypsin. To improve this 
operation, the kit uses a simple three-step, 
easy-to-implement process that can be 
automated. Its immobilized trypsin design 
is formulated for high-quality data with a 
significantly reduced sample preparation 
time compared to traditional in-solution 
digestion methods. The increased 
reproducibility of digestion possible with 
this kit should enable users to implement 
generic, rapid, and robust methods for 
high-throughput sample processing. 

The SMART Digest kit is also designed 
to provide higher sequence coverage 
while reducing both chemically induced 
posttranslational modifications and the 
number of chemicals in the final sample. 
Thermo Fisher Scientific 
For info: +45-70-23-62-60 
www.thermoscientific.com/smartdigest 
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Human Nutrition, Environment & Heaith (Tl) 

October 1 4-1 8, 201 5 | Beijing | China 

Diabetes: New Insights into Molecular Mechanisms 
& Therapeutic Strategies (T2) 

October 25-29, 201 5 | Kyoto j Japan 

Systems Immunology: From Molecular Networks to Human Biology (Al) 

January 1 0-1 4, 201 6 1 Big Sky, Montana | USA 

Cytokine JAK-STAT Signaling in Immunity & Disease (A2) 

January 10-14,2016 1 Steamboat Springs, Colorado | USA 

Molecular & Cellular Basis of Growth & Regeneration (A3) 

January 1 0-1 4, 201 6 1 Breckenridge, Colorado | USA 

Nuclear Receptors: Full Throttle (Jl) 

Jointwith Metabolism, Transcription & Disease (J2) 

January 1 0-1 4, 201 6 1 Snowbird, Utah | USA 

Biology of Down Syndrome: Impacts Across the Biomedical Spectrum (A4) 

January 24-27, 201 6 1 Santa Fe, New Mexico | USA 

Traumatic Brain Injury: Clinical, Pathological & Translational Mechanisms (J3) 
jointwith Akous: From Cell Biology to Pathology (J4) 

January 24-27, 201 6 1 Santa Fe, New Mexico | USA 

Drug Discovery for Parasitic Diseases (A5) 

January 24-28, 201 6 1 Tahoe City, California | USA 

Small RNA Silencing: Little Guides, Big Biology (A6) 

Jan 24-28, 2016 1 Keystone, Colorado | USA 

Purinergic Signaling (J5) jointwith Cancer Immunotherapy: 

Immunity & Immunosuppression Meet Targeted Therapies (J6) 

January 24-28, 201 6 1 Vancouver, British Columbia | Canada 

Neurological Disorders of Intracellular Trafficking (A7) 

January 31 -February 4, 201 6 1 Keystone, Colorado | USA 

Cell Biology & Immunology of Persistent Infection (A8) 

January 31 -February 4, 201 6 1 Banff, Alberta | Canada 

The Cancer Genome (Ql) 

jointwith Genomics & Personalized Medicine (Q2) 

February 7-1 1 , 201 6 1 Banff, Alberta | Canada 

Fibrosis: From Basic Mechanisms to Targeted Therapies (Q3) 
jointwith Stromal Cells in Immunity (Q4) 

February 7-1 1, 2016 1 Keystone, Colorado | USA 

Plant Epigenetics: From Genotype to Phenotype (Bl) 

February 1 5-1 9, 201 6 | Taos, New Mexico | USA 

Obesity & Adipose Tissue Biology (B2) 

February 1 5-1 9, 201 6 | Banff, Alberta | Canada 

Noncoding RNAs in Health & Disease (Q5) 
jointwith Enhancer Malfunction in Cancer (Q6) 

February 21 -24, 201 6 | Santa Fe, New Mexico | USA 

G Protein-Coupled Receptors: Structure, Signaling & Drug Discovery (B3) 

February 21-25, 2016 | Keystone, Colorado | USA 

New Frontiers in Understanding Tumor Metabolism (Q7) 

jointwith Immunometabolism in Immune Function & Inflammatory Disease (Q8) 

February 21 -25, 201 6 | Banff, Alberta | Canada 

T Follicular Helper Cells & Germinal Centers (B4) 

February 26-March 1, 2016 1 Monterey, California | USA 

Immunity in Skin Development, Homeostasis & Disease (B5) 

February 28-March 2, 201 6 1 Tahoe City, California | USA 

Tuberculosis Co-Morbidities & Immunopathogenesis (B6) 

February 28-March 3, 2016 1 Keystone, Colorado | USA 



Stem Cells & Cancer (Cl) 

March 6-1 0, 201 6 | Breckenridge, Colorado | USA 

Cancer Vaccines: Targeting Cancer Genes for Immunotherapy (XI) 
jointwith Antibodies as Drugs (X2) 

March 6-1 0, 201 6 1 Whistler, British Columbia | Canada 

Ubiquitin Signaling (X3) 

jointwith NF-kB & MAP Kinase Signaling in Inflammation (X4) 

March 1 3-1 7, 201 6 1 Whistler, British Columbia | Canada 

Islet Biology: From Cell Birth to Death (X5) 

jointwith Stem Cells & Regeneration in the Digestive Organs (X6) 

March 1 3-1 7, 201 6 1 Keystone, Colorado | USA 

Chromatin & Epigenetics (C2) 

March 20-24, 2016 1 Whistler, British Columbia | Canada 

HIV Persistence: Pathogenesis & Eradication (X7) 
jointwith HIV Vaccines (X8) 

March 20-24, 2016 1 Olympic Valley, California | USA 

Cancer Pathophysiology: Integrating the Host & Tumor Environments (C3) 

March 28-Apr 1, 2016 | Breckenridge, Colorado | USA 

Modern Phenotypic Drug Discovery: Defining the Path Forward (Dl) 

April 2-6, 2016 1 Big Sky, Montana j USA 

Mitochondrial Dynamics (D2) 

April 3-7, 201 6 1 Steamboat Springs, Colorado | USA 

Heart Failure: Genetics, Genomics & Epigenetics (Zl) 
jointwith Cardiac Development, Regeneration & Repair (Z2) 

April 3-7, 2016 1 Snowbird, Utah | USA 

Myeloid Cells (D3) 

April 1 0-1 4, 2016 | Killarney, County Kerry | Ireland 

New Therapeutics for Diabetes & Obesity (Gl) 

April 1 7-20, 201 6 | La Jolla, California | USA 

Gut Microbiota, Metabolic Disorders & Beyond (D4) 

April 1 7-21 , 2016 | Newport, Rhode Island | USA 

Epigenetic & Metabolic Regulation of Aging & Aging-Related Diseases (El) 

May 1 -5, 201 6 1 Santa Fe, New Mexico | USA 

Positive-Strand RNA Viruses (Nl) 

May 1 -5, 201 6 1 Austin, Texas | USA 

Nucleic Acid Sensing Pathways: 

Innate Immunity, Immunobiology & Therapeutics (E2) 

May 8-1 2, 201 6 | Dresden | Germany 

State of the Brain (Rl) 

May 22-26, 201 6 1 AIpbach | Austria 

New Approaches to Vaccines for Human & Veterinary Tropical Diseases (Ml) 

May 22-26, 2016 1 Cape Town | South Africa 

B Cells at the Intersection of Innate & Adaptive Immunity (E3) 

May 29-Jun 2, 2016 | Stockholm | Sweden 

Understanding the Function of Human Genome Variation (Kl) 

May 31 -Jun 4, 2016 | Uppsala | Sweden 

Autophagy: Molecular & Physiological Mechanisms (VI) 

June 5-9, 201 6 | Whistler, British Columbia | Canada 

Common Mechanisms of Neurodegeneration (Z3) 
jointwith Microglia in the Brain (Z4) 

June 1 2-1 6, 201 6 1 Keystone, Colorado | USA 

Exosomes/Microvesicles: 

Novel Mechanisms of Cell-Cell Communication (E4) 

June 1 9-22, 201 6 1 Keystone, Colorado | USA 
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Science Careers 
Advertising 

For full advertising details, go to 
ScienceCareers.org and click 
For Employers, or call one of 
our representatives. 

Tracy Holmes 

Worldwide Associate Director 

Science Careers 

Phone: +44 (0) 1223 326525 

THE AMERICAS 

E-mail: advertise@sciencecareers.org 
Fax: +1 (202) 289 6742 

Tina Burks 

Phone: +1 (202) 326 6577 

Nancy Toema 

Phone: +1 (202) 326 6578 

Online Job Posting Questions 

Phone: +1 (202) 312 6375 

EUROPE / INDIA / AUSTRALIA / 

NEW ZEALAND / REST OF WORLD 

E-mail: ads@science-int.co.uk 
Fax: +44 (0) 1223 326532 

Sarah Lelarge 

Phone: +44 (0) 1223 326527 

Kelly Grace 

Phone: +44 (0) 1223 326528 

Online Job Posting Questions 

Phone: +44 (0) 1223 326528 

lAPAN 

Katsuyoshi Fukamizu (Tokyo) 

E-mail: kfukamizu@aaas.org 
Phone: +81 3 3219 5777 

Hiroyuki Mashiki (Kyoto) 

E-mail: hmashiki@aaas.org 
Phone: +81 75 8231109 

CHINA / KOREA / SINGAPORE / 
TAIWAN / THAILAND 

Ruolei Wu 

Phone: +86 186 0082 9345 
E-mail: rwu@aaas.org 

All ads submitted for publication must comply with 
applicable U.S. and non-U.S. laws. Science reserves 
the right to refuse any advertisement at its sole 
discretion for any reason, including without limitation 
for offensive language or inappropriate content, 
and all advertising is subject to publisher approval. 
Science encourages our readers to alert us to any ads 
that they feel may be discriminatory or offensive. 
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Learn more and 
keep your job search 
out of the cheap seats. 

• Search thousands of job postings 

• Create job alerts based on your criteria 

• Get career advice from our Career Forum experts 

• Download career advice articles and webinars 

• Complete an individual development plan at “myIDP” 

Target your job search using relevant resources 

on SdenceCareers.org. 
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Columbia University 
Medical Center 



College of Physicians and Surgeons 

Chair of the Department of Pharmacology 



The Columbia University Sehool of Medieine seeks to fill a position for the ehair of the 
department of Pharmaeology. This position will be assoeiated with a tenure-traek faeulty 
positions at the Full Professor level. We weleome applieations from outstanding seientists, 
with an international reputation in the fields of pharmaeology, ehemieal biology, or bio- 
ehemistry whose researeh interests lie in the following strategie areas of biomedieal seienee: 

1. Elueidation of pharmaeologieal targets within signal transduetion networks 

2. Chemieal Biology 

3. Systems Pharmaeology and Pharmaeogenomies 

4. Experimental Therapeuties 

Applieants should have a Ph.D., MD., M.D./PhD., or equivalent degree, substantial interna- 
tional stature and expertise in managing large-seale researeh programs, and substantial traek 
reeord of funded researeh in the fields deseribed above. 

The eandidate is expeeted to develop a new innovative direetion for the Columbia University 
Department of Pharmaeology, aligning it with the most reeent trends and direetions in the 
elueidation and eharaeterization of moleeular targets, genomies of drug response, and other 
relevant and pertinent researeh direetions for the field. He or she will also be expeeted to 
develop a vibrant, externally funded, and leading personal researeh program in an area 
related to the above researeh direetions, as well as partieipating in graduate and medieal 
student teaehing. In addition to a generous start-up and reeruitment paekage, an endowed 
ehair is available for this position. 

Please send a eurrieulum vitae, representative reprints, and a statement of eurrent and future 
researeh interests, as well as three letters of reeomendation. Review of applieations will 
eommenee on Deeember 15, 2015. 

Material should be submitted eleetronieally to Janiris Jorge, jj318@cumc.columbia.edu 

Columbia University is an affirmative action/equal opportunity 
employer and welcomes applications from women and minorities. 



irrsouthwestern 

Medical Center 

The Department of Molecular Biology and 
the Hamon Center for Regenerative Science 
and Medicine (CRSM) at the University 
of Texas Southwestern Medical Center 
invite applications for tenure track faculty 
positions at the level of Assistant, Associate, 
or Full Professor. We are seeking interactive 
individuals with strong research programs 
focused on mechanistic aspects of gene 
regulation, cell growth and differentiation, and 
stem cell biology, including the use of cellular 
and animal models to study human disease. 
Attractive recruitment packages, state-of-the- 
art core facilities, and exceptional laboratory 
space are available. UT Southwestern has a 
vibrant graduate program and an atmosphere of 
collegiality and collaboration. Academic rank 
will be awarded commensurate with experience. 

Applicants should submit a curriculum 
vitae containing a summary of past research 
accomplishments, a statement of future 
objectives, and names of three references via 
email to: Dr. Eric N. Olson, Department 
of Molecular Biology, Hamon Center 
for Regenerative Science and Medicine, 
University of Texas Southwestern Medical 
Center, MolBioSearch@UTSouthwestern. 
edu. 

UT Southwestern Medical Center is an 
Affirmative Action/Equal Opportunity 
Employer. Women, minorities, veterans 
and individuals with disabilities are 
encouraged to apply. 



Advance your career with expert 
advice from Science Careers. 




Download Free Career Advice Booklets! 

ScienceCareers.org/booklets 

Featured Topics: 



■ Networking 

■ Industry or Academia 

■ Job Searching 

■ Non-Bench Careers 

■ And More 
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Department of Health and Human Services 
Mationai Institute of National Institutes of Health 

{jenerai Medical lienees National Institute of General Medical Sciences 

Division of Pharmacology, Physiology, and Biological Chemistry 
DIVISION DIRECTOR 

The National Institute of General Medieal Seienees (NIGMS), a major researeh eomponent of the National Institutes of Health (NIH) and the Department of Health and 
Human Serviees (DHHS), is seeking exeeptional eandidates for the position of Direetor, Division of Pharmaeology, Physiology, and Biologieal Chemistry (PPBC). 
Information about PPBC is available at http://www.nigms.nih.gov/about/overview/pages/ppbc.aspx. PPBC has supported many researeh findings that have led to 
improving the moleeular-level understanding of fundamental biologieal proeesses and diseovering approaehes to their eontrol. In FY 2015, the Division of Pharmaeology, 
Physiology, and Biologieal Chemistry had an annual budget of about $405 million and is one of five seientifie divisions within the NIGMS. PPBC is organized into two 
branehes, and has 11 seientifie staff members who serve as program offieers. One braneh is foeused on the development of biologieal eatalysts, ineluding living 
organisms, for the produetion of useful ehemieal eompounds, medieinal or diagnostie agents or probes of biologieal phenomena and the other braneh is foeused on under- 
standing the total body response to injury, ineluding bioehemieal and physiologieal ehanges indueed by trauma, as well as the efieets of dmgs on the body, the body’s 
efifeets on dmgs and how the effeets of dmgs vary fi-om individual to individual. The division direetor reports to the NIGMS direetor and is a member of the NIGMS 
senior leadership team, whieh helps set polieies and priorities for the Institute. 

Researeh supported by PPBC takes a multifaeeted approaeh to problems in pharmaeology, physiology, bioehemistry and biorelated ehemistry that are very basie in nature 
or have implieations for more than one disease eategory. The Direetor of PPBC plans and direets a program of researeh grants and eontraets, analyzes national researeh 
efforts on moleeular-level understanding of fundamental biologieal proeesses, dmg aetion and meehanisms of anesthesia, new methods and targets for dmg diseovery, 
advanees in natural produets synthesis, biologieal eatalysis, elinieal pharmaeology and trauma and bum injury. He/she makes reeommendations to assist the National 
Advisory General Medieal Seienees Couneil or other advisory eommittees or groups, identifying the need for researeh in the areas of pharmaeology, physiology, bio- 
ehemistry and biorelated ehemistry, and advises universities, other eenters of medieal researeh and professional and lay organizations about researeh needs and require- 
ments. This position offers important opportunities to set seientifie priorities, lead ehange and improve the researeh enterprise. 

Qualifications: Candidates must possess an M.D., Ph.D., or equivalent degree in a field relevant to the position. The ideal eandidate will have eonsiderable researeh 
experienee and will possess a broad speetmm of seientifie knowledge related to the NIGMS mission. In addition, eandidates should possess reeognized researeh 
management and leadership abilities. A strong understanding of pharmaeology, physiology, bioehemistry and ehemistry that deepens the understanding of biology is 
desired. The position will be filled under a Title 42 (f) exeepted serviee appointment. 

Salary/Benefits: Salary is eompetitive and will be eommensurate with the experienee of the eandidate. A reemitment or reloeation bonus may be available, and 
reloeation expenses may be paid. A full paekage of Federal Civil Serviee benefits is available, ineluding: retirement, health and life insuranee, long term eare insuranee, 
leave, and a Thrift Savings Plan (40 IK equivalent). The sueeessful eandidate is subjeet to a baekground investigation and finaneial diselosure requirements. 

How to Apply: Applieants must submit a eurrent eurrieulum vitae, bibliography, eopy of degree, and full eontaet details for three referenees. In addition, applieants are 
asked to prepare two statements: a vision statement and a statement that addresses the speeifie qualifieation requirements (please limit the statements to two pages eaeh). 
NIGMS will be aeeepting applieations from September 8, 2015, and plans to have the position open for at least 45 days, but the applieation proeess will not 
elose until a eandidate has been seleeted. Please send your applieation paekage to PPBCDirectorApplicants@nigms.nih.gov. 

You may contact Linda Sarden with questions about this vacancy lsarden@mail.nih.gov or (301) 594-0534. 

HHS and NIH are Equal Opportunity Employers 




Harvard University seeks outstanding 
appiicants for a position as a 
John Harvard Distinguished 
Science Feiiow 



The Fellow will be housed in the FAS Center for Systems Biology and 
affiliated with one of the Life Science departments in FAS. The Center is 
an intensively collaborative, interdisciplinary center, where scientists from 
a wide variety of backgrounds study cellular pathways and networks, with 
the goal of finding general principles underlying the structure, behavior and 
evolution of cells and organisms. The Fellows are independent researchers, 
who have completed a PhD within twelve months of their appointment, 
who receive funding to run a small research group and are appointed for 
a three-year term, with the expectation that it will be extended by two 
years after review. 

For more information about the Fellows program and this position, visit 
our website at http://sysbio.harvard.edu/jobs. 

We are accepting applications until October 15, 2015. References must 
be received by this deadline. Interviews will take place on December 
3-4, 2015. Please submit a curriculum vitae, research proposal (up 
to 5 pages), summary of previous research accomplishments (up to 2 
pages), and PDFs of up to 3 publications. All files must be submitted 
electronically in PDF or Word format. During the application process, 
you will be asked to give the e-mail addresses of at least three colleagues 
who will be contacted automatically with a request to upload a letter of 
recommendation. Please allow at least 1 week for your referees to upload 
their letters. For further questions about the fellow search, please contact 
Linda Kefalas at lkefalas@mcb.harvard.edu. Successful candidates can 
start their position in early 2016. 

Please submit application here: http://academicpositions.harvard.edu/ 
postings/6393 

Applications from, or nominations of, women and minority candidates 
are encouraged. Harvard University is an Affirmative Action/Equal 
Opportunity Employer. 




* 

NOVEL MATERIALS DISCOVERY 



The European Center of Excellence 
(CoE) on Novel Materials Discovery 
(NOMAD) offers 20 positions for 
highly qualified PhD students and 
postdocs 

The NOMAD CoE develops a Materials Encyclopedia 
and Big-Data Analytics tools for materials science and 
engineering. Eight complementary research groups 
of highest scientific standing in computational 
materials science along with four high-performance 
computer centers form the synergetic core of this 
CoE. 

Interested candidates are invited to contact the PI 
they are interested to work with. The list of Pis and 
more details about the available positions can be 
found on our preliminary web site: 

http://NOMAD-CoE.eu/. 
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Our Next Breakthrough IS YOU. 
Lawrence Postdoctoral Fellowship 

The Lawrence Livermore National Laboratory (LLNL) has openings 
available under its Lawrence Fellowship Program. This is a highly desirable 
postdoctoral position that provides freedom to conduct independent, cutting- 
edge research, directed by the candidate, in an area of the candidate’s 
choice. The duration of the Fellowship is up to three years. Typically two 
to four openings are available each year. Fellowships are awarded only 
to candidates with exceptional talent, credentials and a track record of 
research accomplishments. 

Successful candidates will propose and subsequently conduct original 
research in one or more aspects of science relevant to the mission and goals 
of LLNL. Possible scientific areas include: Physics, Applied Mathematics, 
Computer Science, Chemistry, Material Science, Engineering, Environmental 
Science, Atmospheric Science, Geology, Energy, Lasers and Biology. 
Lawrence Fellows may participate in experimental or theoretical work at 
LLNL and will have access to LLNL’s extensive computing facilities and 
specialized laboratory facilities. A senior scientist will be matched to the 
Fellow to serve as a collaborator and mentor. The candidates will receive full 
management and administrative support. The salary is $8, 628/mo. 



FACULTY RECRUITMENT 

Mechanoblology Institute. Singapore 
National University of Singapore 

The Mech a nobiology Institute, Singapore is a multi disciplinary 
institute commuted to developing new paradigms for quantitative 
analysis of biolog icai function. We are seeking outstanding 
candidates for tenure-track faculty positions at all levels viith a 
background in molecular and cell biology, bacterial pathogenesis, 
correlative microscopieSp bioiogy-related engineering^ or 
computational biology^ and a commitment to collaborative 
intcrdisciplinafy research. Wo offer an open lab environment and 
extensive central fadtitEes support. Successful cai^didates will hold 
joint appointments through relevant departments at the National 
Universitv of Sing^re or other Singapore Universities. 

Singapore fosters a rich and vibrant research environment I he 
National Research Foundation of Singapore offers generous research 
grants through the Singapore NRF Feilowships that can be awarded to 
outstanding young scientists of any nationality to support their 
independent research, (www.irf gw sgl 



Please refer to the following web page http://apptrkr.com/634081 for 
eligibility requirements and instructions on how to apply. When applying 
and prompted, please mention where you saw this ad. The deadline for 
applications is November 1, 2015. LLNL is operated by the Lawrence 
Livermore National Security, LLC for the U.S. Department of Energy, National 
Nuclear Security Administration. We are an equal opportunity employer with 
a commitment to workforce diversity. 



Lawrence Livermore 
National Laboratory 



http://fellowship.llnl.gov 



Please submit your application 
along with turriculum vitae full 
publication llst^ research plans 
and names of three erttemal 
referees to : 

••siMBI 

MhriiAirfra m:.ik^v*urn \ ‘I'lnli 



Professor A^ichae I Sheetz 
Pirector 

The hrtechanobliolagy institute. 

SEhpapore 

Natianal Mniversity of Singapore 
T-Lab . S A Engineering Driv^ 1 
Singapore 1 1 741 1 
E-mail ; mbibrttnvs.idu.ig 
Website : mbi.nus.Edu.s9 
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SANF; 5 RD 

RESEARCH 

Postdoctoral Fellow/Staff Scientist - Guo Lab 

The mission of the Sanford Researeh is to develop pediatrie translational 
researeh. Sanford Researeh is aetively using interdiseiplinary approaehes 
to understand the underlying basis of a wide array of eongenital defeets and 
ehildhood diseases. Several speeifie pediatrie diseases are eurrently under 
investigation. A major initiative within the eenter is the Sanford Projeet, 
whieh aims to better understand, treat, and eure type I diabetes. The eenter 
also ineludes expertise in a wide range of diseiplines, ineluding moleeular 
biology, eell biology, developmental biology, bioehemistry, and geneties. 

At the elinieal onset of type 1 diabetes (TID), human patients still 
have some funetional beta eell mass remaining. If we eould eontrol 
autoimmunity against beta eells, TID may be eured by inereasing the 
remaining beta eell mass to restore normoglyeemia. The goal of eurrent 
researeh in Dr. Guo’s laboratory is to develop therapeutie approaehes to 
treating TID by ameliorating autoimmunity and by promoting beta eell 
survival and regeneration. 

Dr. Guo’s laboratory is eurrently seeking a full time Postdoetoral Fellow 
or Staff Seientist to earry out independent seientifie researeh ineluding 
eomplex laboratory testing, experiments and analysis following established 
protoeols. A sueeessful eandidate will write teehnieal summaries and 
eo-author artieles for publieation as well as present formal presentations 
of researeh results. The individual will work under the broad direetion of 
the prineipal investigator. 

Both the Postdoetoral Fellow and Staff Seientist levels require a PhD in 
a elinieal, health or life seienee field or MD with suffieient publieation 
history. Previous relevant experienee is preferred. Staff Seientist level 
also requires a minimum of three years postdoetoral experienee in related 
subjeet field of study. 

To view a job deseription and apply, visit: 

http://www.sanfordresearch.org/careers 
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faculty 

Career Feature 

October 9, 2015 

Reserve ads by September 22 

"^ds.^cepted until October 5 
on a fir^-cqme, first-served basis. 
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on a fir^-cqme, first-served basis. 
For recruitment in science, there’s only one ScicilCe 



Why choose this faculty feature 
for your advertisement? 

■ Relevant ads lead off the career section 
with special Faculty banner 

■ Special distribution to 25.000 
scientists beyond our 
regular circulation. 




Science 

Careers 



SCIENCECAREERS.ORG 



To book your ad: advertise@sciencecareers.org 



The Americas 
202-326-6582 
Europe/RoW 
+44(0)1223-326500 



Japan 

+81-3-3219-5777 

China/Korea/Singapore/Taiwan 

+86-186-0082-9345 






THE BEST 

THE BEST PLACE Beijing, the capital of China 

THE BEST CHANCE Research Feliow/Postdoc/PL. 
THE BEST SALARY Highly compeUtive 




www,bit.ediixn wwwncepu.edu.cn www.buaa.edu.cn www.cnu.edu.cn 



We offer the scientific dreams 
rather than a simple position 
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Far more information, ploase visit www.edu. on/zhaopin 
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POSITIONS OPEN 



ASSISTANT/ASSOCIATE/FULL PROFESSOR- 
Department of Pharmaceutical Sciences 

The College of Pharmacy, Washington State Univer- 
sity (WSU) on the Health Sciences campus in Spokane, 
WA invites applications for a 12 month, full-time, per- 
manent tenure-track faculty position at the rank of As- 
sistant, Associate, or Full Professor (commensurate with 
experience) in the Department of Pharmaceutical Sciences. 
Applicants must have an earned terminal advanced 
degree (Ph.D., PharmD, or M.D. in Pharmacology, Phar- 
maceutical Sciences, or a related discipline) by the date of 
hire. The Assistant Professor candidate must possess a track 
record of accomplishment to demonstrate strong potential 
to become an outstanding scholar and educator. The As- 
sociate Professor candidate must possess a track record of 
accomplishment demonstrating he/she is an outstanding 
scholar and educator with an active, nationally recog- 
nized, extramurally funded, research program; and have 
served at least six years as Assistant Professor or equiv- 
alent. The Professor candidate must possess a track record 
of accomplishment demonstrating he/she is an outstand- 
ing scholar and educator with an active, internationally 
recognized, extramurally-funded, research program; and 
have served at least six years as Associate Professor or 
equivalent. Areas of special research interest include 
molecular targets and biomarkers of disease, pharmacoge- 
nomics, and drug delivery and development. The suc- 
cessful candidate will be expected to maintain an active 
extramurally-funded research program, mentor graduate 
students and fellows, and teach in the graduate courses 
on topics of pharmacology, biochemistry, and/or mo- 
lecular biology and, potentially, professional curricula. 

Screening of applications will begin September 15, 
2015 and will continue until suitable candidates are iden- 
tified. To see complete position announcement and apply, 
please visit website: www.wsujobs.com. Applications must 
include the following materials: (1) A cover letter of ap- 
plication describing short and long-term goals, relevant 
academic preparation, and experience; (2) A current cur- 
riculum vitae; and (3) Name, title, organization, phone 
number and e-mail address for four people willing to 
serve as employment references. 

WSU is an Equal Opportunity / Affirmative Action/ADA 
Educator and Employer. 



The Department of Anatomy & Cell Biology, Brody 
School of Medicine at East Carolina University (ECU) 
is seeking applicants for a tenure-track or fixed-term 
ASSISTANT/ASSOCIATE PROFESSOR position 
in Greenville, North Carolina. Rank will be commen- 
surate with qualifications. We are seeking applicants 
with a strong interest in graduate student education 
and enthusiasm and experience in medical education in 
human gross anatomy and neuroanatomy. Research 
programs must compliment the strengths of ongoing 
research in the department (website: http:/ /www.ecu. 
edu/ anatomy) . 

Candidates must apply using the online application 
process to position #957507 at website: https://ecu. 
peopleadmin.com/applicants/jsp/shared/position/ 
JobDetails_css. jsp . 

ECU is a constituent institution of the University of North 
Carolina and an Equal Opportunity /Affirmative Action Univer- 
sity that accommodates the needs of individuals with disabilities. 



ASSISTANT/ASSOCIATE PROFESSOR 
Ecology and Evolutionary Biology 

The Department of Ecology and Evolutionary Biol- 
ogy, Tulane University, invites applications for one 
TENURE-TRACK POSITION at the level of assistant 
professor or associate professor. See website: http:// 
tulane.edu/ sse/ eebio/about/ positions for details 
about the position, department, and search. Submit letter 
of application, curriculum vitae, statements of research and 
teaching interests, selected publications, and names and 
addresses of three references to: e-mail: apply.interfolio. 
com/30088. Review of applications will begin Octoba* 15, 
2015, and the search will remain open until the posi- 
tion is filled. Tulane is an Equal Opportunity Employer/Minorities/ 
Female/ Veteran/Disabled employer. 



ASSISTANT PROFESSOR 

The Department of Chemistry and Biochemistry 
and Institute of Molecular Biology at the University of 
Oregon (websites: http://chemistry.uoregon.edu; 
http://molbio.uoregon.edu) have an opening for a 
tenure-track biochemistry faculty member at the Assist- 
ant Professor level to begin in Fall 2016 or later. 

The primary selection criteria will be the potential 
for establishing a vigorous independent research pro- 
gram addressing fundamental problems in molecular biol- 
ogy, and excellence in teaching at the undergraduate and 
graduate levels. We will consider applicants working in 
all areas of molecular biology, including those using 
cutting-edge approaches to study the mechanisms of 
cell and genome function, including RNA and protein 
biochemistry. We seek candidates with a demonstrated 
commitment to working effectively with students, faculty 
and staff from diverse backgrounds. 

To assure full consideration, apply at website: https:// 
academiqobsonline.org/ajo/jobs/6024 and upload a 
curriculum vitae, a statement of research plans and ob- 
jectives, and a brief statement of teaching philosophy or 
interests. The candidate should also arrange for three let- 
ters of recommendation to be uploaded directly by the 
recommenders via the preceding UBT link by October 15, 
2015. Review of application materials will continue until 
the position is filled. 

The University of Oregon is an Equal Opportunity, Affirmative 
Action Institution committed to cultural diversity and compliance 
with the ADA. The University encourages all qualified individuals 
to apply, and does not discriminate on the basis of any protected 
status, including veteran and disability status. 



BIOLOGY DEPARTMENT: 
ASSISTANT PROFESSOR OF MICROBIOLOGY 

Code: tenure track Microbiology Faculty Position. 
The Department of Biology at Clark University, 
Worcester Massachusetts website: (wv^w. clarku.edu/ 
departments/biology/) invites applications for a tenure 
track appointment at the rank of Assistant Professor 
to begin Fall 2016. The successful candidate is expected 
to develop an externally- funded research program in 
any field of microbiology, involving PhD, MS, and 
undergraduate students, and will teach microbiology 
and courses in their area of expertise. Clark University 
is especially interested in qualified candidates who can 
contribute, through their research, teaching, and/or 
service, to the diversity and excellence of the academic 
community. 

For full consideration, one electronic file with a 
Curriculum Vitae, research and teaching statements, 
and three key publications should be received prior 
to November 1, 2015, as should three letters of refer- 
ence e-mail: (microbiology@clarku.edu). See website 
http://wwwv.clarku.edu/offices/hr for full position 
description. Inquiries may be directed to Justin Thackeray 
e-mail: (jthackeray@clarku.edu). We are an Affirmative Action/ 
Equal Employment Opportunity University and we strongly en- 
courage women and members of minority groups to apply. 



PRINCETON UNIVERSITY DEPARTMENT 
OF CHEMISTRY ASSISTANT PROFESSOR 

The Department of Chemistry at Princeton Uni- 
versity invites applications for a tenure-track assistant 
professor position in all areas of chemistry. We seek a 
faculty member who will create a climate that embraces 
excellence and diversity with a strong commitment 
to research and teaching that will enhance the work 
of the department and attract and retain a diverse stu- 
dent body. We strongly encourage applications from 
members of all underrepresented groups.Candidates 
are expected to have completed the Ph.D. in chemistry 
or a related held at the time of appointment. Applicants 
should submit a description of research interests, cur- 
riculum vitae, a list of publications, and contact informa- 
tion for three referees online at website: https://jobs. 
princeton. edu /applicants /jsp /shared /position / 
JobDetails_css.jsp. The deadline for applications is 
October 15, 2015. 

Princeton University is an Equal Opportunity Employer. All 
qualified applicants will receive consideration for employment with- 
out regard to race, color, religion, sex, national origin, disability status, 
protected veteran status, or any other characteristic protected by law. 
This position is subject to the University’’ s background check policy. 



ASSISTANT/ASSOCIATE PROFESSOR 
Department of Chemistry 

The Department of Chemistry at Indiana University 
invites applications for a position in our Chemical Biol- 
ogy area beginning August 2016, at the rank of Assistant 
or Associate Professor (tenure-track). The search is open 
to candidates with interests in all areas of chemical biology, 
from structural biology and molecular biophysics, to bio- 
analytical chemistry and metabolic profrling. A Ph.D. in 
chemistry or a related field and postdoctoral experience are 
required. Successful candidates will be expected to develop 
or already have a visible, externally funded research pro- 
gram. Primary teaching assignments will involve graduate- 
level biochemistry and chemical biology and undergraduate 
chemistry and biochemistry. The successful candidate will 
be named a trainer in our MH-fimded Chemistry-Biology 
Interface graduate training program in Quantitative and 
Chemical Biology at Indiana University (website: http:// 
www.chan.indiana.edu/qcb/). Applications completed 
by October 15, 2015 will receive full consideration, but 
the review will continue until the position is filled. Inter- 
ested candidates should review the application requirements 
and submit a complete Curriculum Vitae, a summary of 
future research plans, and arrange to have four letters of 
recommendation at website: https://indiana.peopleadmin. 
com). Questions regarding the position or application pro- 
cess can be directed to: Professor David Giedroc, Chair, 
Faculty Search Committee, Department of Chemistry, 
Indiana Univasity, 800 E. Kirkwood Avenue, Blooming- 
ton, IN 47405, e-mail: chemchair@indiana.edu. 

Indiana University is an Equal Employment and Affirmative 
Action Employer and a provider of ADA services. All qualified ap- 
plicants will receive consideration for employment without regard to 
age, ethnicity, color, race, religion, sex, sexual orientation or identity, 
national origin, disability status or protected veteran status. 

CHEMISTRY 

Assistant, Associate, or Full Professor 

The Department of Chemistry at Indiana University 
invites applications for Tenure/Tenure-track Assistant, 
Associate, or Full Professor position in Materials Chem- 
istry beginning August 2016. The search is open to all 
areas of materials chemistry, including theory, modeling, 
and experimental synthetic approaches to the assembly 
of inorganic and soft materials as well as the application 
of analytical and physical chemistry to materials. Candi- 
dates with cross-disciplinary research interests are strong- 
ly encouraged to apply. A Ph.D. in chemistry or a related 
field and postdoctoral experience are required. Successful 
candidates will be expected to develop or have already de- 
veloped a visible, externally funded research program. Pri- 
mary teaching assignments will include graduate courses 
in materials chemistry and undergraduate chemistry. Ap- 
plications completed by October 15, 2015 will receive 
full consideration, but the review will continue until the 
position is filled. Interested candidates should review the 
application requirements and submit a complete Curric- 
ulum Vitae, a summary of future research plans, and ar- 
range to have four letters of recommendation at the link 
(website: https://indiana.peopleadmin.com). Questions 
regarding the positions or application process can be 
directed to: Professor Sara Skrabalak, Chair, Faculty 
Search Committee, Department of Chemistry, Indiana 
University, 800 E. Kirkwood Avenue, Bloomington, 
IN 47405, e-mail: chemchair@indiana.edu. 

Indiana University is an Equal Employment and Affirmative 
Action Employer and a provider of ADA services. All qualified 
applicants will receive consideration for employment without regard 
to age, ethnicity, color, race, religion, sex, sexual orientation or iden- 
tity, national origin, disability status or protected veteran status. 
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